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COMPOSITIONS AND METHODS FOR THE HIGH EFFICIENCY EXPRESSION OF 
THE TRANSFORMING GROWTH FACTOR-BETA SUPERGENE FAMILY 



CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims priority to United States Provisional Patent Application 
Serial No. 60/534,379, filed January 6, 2004 and 60/575,839, filed June 2, 2004. 

GOVERNMENTAL RIGHTS 

[0002] This invention was funded by the National Institute of Allergy and Infectious 
Diseases at the National Institutes of Health. The United States Government has certain rights to 
this invention. 

FIELD 

[0003] The present invention relates to compositions and methods for the high 
efficiency production of the transforming growth factor-P (TGFp) supergene family members. 

BACKGROUND 

[0004] Transfonning growth factor-P 1 (TGFpi) was first isolated from human placenta 
in 1983 (Frolik etaL, 1983, Proc. Natl Acad. Sci. U. S. A 80:3676-3680). There are currently 
five distinct isoforms of TGFp sharing 64-82% identity; three of the five, TGFpi, -p2 and -p3, 
are expressed in mammalian tissues (Govinden et al, 2003, Pharmacol Tlier. 98:257-265). 
TGFpi belongs to a supergene family of over 40 different related proteins, presxuned to be 
derived from a common ancestor gene (Kingsley et al^ 1994, Geties Dev. 8:133-146). TGFpi is 
the prototype of the supergene fanwly. 
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[0005] Although TGFpl was initially described as a factor that caused rat kidney 
fibroblasts to proliferate (Roberts et al, 1981, Proc. Natl Acad. ScL U. S. A. 78:5339-5343), it is 
now known that TGFpi is a multifunctional cytokine with both stimulatory and inhibitory 
effects on a wide range of cells (Lawrence, 2001, MoL Cell Biochent. 219:163-170). The TGFps 
1, 2, and 3 are involved in normal tissue development such as wound healing, angiogenesis, 
hematopoiesis, mammary gland development, bone metaboUsm, and skin formation. The TGFps 
1, 2, and 3 are also associated with multiple pathologies such as inflammatory and fibrotic 
diseases and tumor development (Gleizes et al, 1997, Stem Cells 15:190-197). Signaling of the 
TGFps is accompUshed through binding of the molecules to type I and type H receptors 
(serine/threonine kinases) on the cell surface. The binding allows receptor n to phosphorylate 
the receptor I kinase domain, which then propagates the signal through phosphorylation of the 
Smad proteins (Shi et al, 2003, Cell 113:685-700). 

[0006] TGFpl is synthesized in cells as a 390-amino acid precursor protein composed 
of a typical leader peptide and a pro-TGFP 1 , The precursor protein undergoes a number of 
intracellular processing steps prior to secretion from the cell. One processing step is the 
proteolytic digestion of the precursor protein between amino acid residues 278 and 279 by the 
endopeptidase fiirin, yielding an N-tenninal portion that is the 65-75 kDa latency-associated 
peptide (LAP), and yielding a C-terminal portion that is the 25-kDa mature TGFpl (Blanchette 
etal, 1991, J. Clin, /rtves/ 99:1974-1983; Dubois era/., 1995, J. BioL Chem. 270:10618-10624). 
After the cleavage of the precursor, the mature TGFp 1 is homodimerized and remains 
noncovalently associated with the LAP until activation (Khahl, 1999, Microbes. Infect, 1:1255- 
1263). 

[0007] LAP, an important component of TGFpl, is required for efficient secretion of 
the cytokine, keeps the cytokine in an inactive form, prevents the cytokine from binding to 
ubiquitous cell surface receptors and maintains the availability of TGFp 1 in a large extracellular 
reservoir that is readily accessed by activation. Most cells release latent TGFpi as a larger 
complex, in which TGFpl is associated with a 120-240-kDa glycoprotein named latent TGFpl 
binding protem (LTBP). 

[0008] TGFpi was initially purified from human platelet with extremely low yield of 
40|xg from 800-1000 hters of human blood (Miyazono et al., 1988, J, Biol Chem. 263:6407- 
6415). Due to its ftmctional importance, attempts have been made to produce recombinant 
TGFpi in both bacteria and mammalian e>q>ression systems. Although attempts to produce 
TGFpi in mammalian cells have been successful, yields have been low (Gentry et al, 1988, Mol 

-2- 



wo 2005/068499 PCT/US2005/000378 
CellBiol. 8:4162-4168; Gentry e^fl/., 1987, Mo/. CellBioL 7:3418-3427; Bourdrel a/., 1993, 

Protein Expr. Pvrif. 4:130-140; Archer et al, 1993, Biochemistry 32:1 152-1 163). Additionally, 

recombinant systems employed to date have required lengthy purification protocols that further 

compromised the yield to less than 1-2 mg of recombinant protein per liter of harvest medium. 

SUMMARY 

[0009] The present invention relates to compositions and methods for the high 
efficiency production of the transforming growth factor-p (TGFp) supergene family members. 

[0010] In one aspect, the invention relates to an isolated mammalian TGFp-encoding 
nucleic acid molecule comprising (a) a pro-TGPp polynucleotide encoding a mammalian pro- 
TGFP polypeptide, wherein the pro-TGFp polynucleotide does not encode a cysteine residue 
within the first ten amino acid residues of tiie pro-TGFp polypeptide; and (b) a signal 
polynucleotide encoding a heterologous signal polypeptide, wherein the polynucleotide is in 
firame with the pro-TGFp polynucleotide. 

[0011] In another aspect, the invention relates to an isolated mammalian TGFp- 
encoding nucleic acid molecule comprising (a) a pro-TGFp polynucleotide encoding a 
mammalian pro-TGFp polypeptide, wherein the pro-TGFp polynucleotide does not encode a 
cysteine residue within the first ten amino acid residues of the pro-TGFp polypeptide; and (b) a 
signal polynucleotide encoding a heterologous signal polypeptide, wherem the polynucleotide is 
in frame with the pro TGFp polynucleotide, and wherein the pro-TGFp polynucleotide encodes a 
mammaKan pro-TGFp polypeptide comprising a mature TGFp portion and a LAP portion, and 
wherein the mature TGFp portion is 95% identical in nucleotide but preferably 100% identical in 
polypeptide to a mature human TGFp. 

[0012] In another aspect, the invention relates to an isolated nucleic acid molecule 
comprising a pro-TGFp polynucleotide, a signal polynucleotide, and a tag polynucleotide 
encoding a pmification tag polypeptide, wherein the polynucleotide encoding the purification tag 
polypeptide is located between, and in frame with, the signal polynucleotide and the 
polynucleotide ^coding the pro-TGFp polypeptide. 

[0013] In another aspect, the invention relates to isolated eukaryotic cell lines 
comprising an isolated mammalian TGFp-encoding nucleic acid molecule fiirfher comprising (a) 
a pro-TGFp polynucleotide encoding a mammalian pro-TGFp polypeptide, wherein the pro- 
TGFp polynucleotide does not encode a cysteine residue within the first ten amino acid residues 
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of the pro-TGFp polypeptide; and (b) a signal polynucleotide encoding a heterologous signal 
polypeptide, wherein the polynucleotide is in frame with the pro TGFp polynucleotide. 

[0014] In another aspect, the invention relates to vectors and expression vectors 
comprising an isolated mammalian TGFP -encoding nucleic acid molecule further comprising (a) 
a pro-TGFp polynucleotide encoding a mammalian pro-TGFp polypeptide, wherein the pro- 
TGFp polynucleotide does not encode a cysteine residue within the first ten amino acid residues 
of the pro-TGFp polypeptide; and (b) a signal polynucleotide encoding a heterologous signal 
polypeptide, wherein the polynucleotide is in frame witii the pro TGFP polynucleotide. In some 
aspects, the nucleic acid is operatively linked to the regulatory sequence in an antisense 
orientation in the expression vector. In other aspects, the nucleic acid is operatively linked to the 
regulatory sequence in a sense orientation in the expression vector. 

[0015] In another aspect invention relates to a host cell comprising an isolated 
mammalian TGFp-encoding nucleic acid molecule ftirther comprising (a) a pro-TGFp 
polynucleotide encoding a mammalian pro-TGFP polypeptide, wherein the pro-TGFp 
polynucleotide does not encode a cysteine residue within the first ten amiao acid residues of the 
pro-TGFp polypeptide; and (b) a signal polynucleotide encoding a heterologous signal 
polypeptide, wherein the polynucleotide is in frame with the pro TGFp polynucleotide. In some 
aspects, the host cell of is a eukaryote. 

[0016] In another aspect invention relates to an comprising an isolated mammalian 
TGFP-encoding nucleic acid molecule further comprising (a) a pro-TGFp polynucleotide 
encoding a mammalian pro-TGFp polypeptide, wherein the pro-TGFp polynucleotide does not 
encode a cysteine residue within the first ten anodno acid residues of the pro-TGFp polypeptide; 
and (b) a signal polynucleotide encoding a heterologous signal polypeptide, wherein the 
polynucleotide is in frame with the pro TGFp polynucleotide. 

[0017] In another aspect the invention relates to eukaryotic cells containing 
polynucleotides of the invention. 

[0018] In another aspect the invention relates to an isolated polypeptide encoded by a 
nucleic acid further comprising (a) a pro-TGFP polynucleotide encoding a mammalian pro- 
TGFp polypeptide, wherein the pro-TGFp polynucleotide does not encode a cysteine residue 
within the first ten amino acid residues of the pro-TGFp polypeptide; and (b) a signal 
polynucleotide encoding a heterologous signal polypeptide, wherein the polynucleotide is in 
frame with the pro TGFp polynucleotide. In some aspects, the isolated polypeptide has the amino 
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acid sequence of SEQ ID NO:l, SEQ ID N0:2, SEQ ID NO:3, or SEQ ID NO:4. In other 

aspects, the isolated polypeptide is fused with a heterologous peptide. 

[0019] In another aspect the invention relates to methods of producing TGPp 
polypeptides comprising culturing eukaryotic cells of the invention under conditions wherein 
greater than 25 mg of mature TGFp per Uter of culture medixun is produced, and recovering the 
TGpp polypeptide from the isolated cell line or its medium. 

[0020] In another aspect, the invention relates to a method of producing mature TGPp 
polypeptide comprising: (a) culturing a eukaryotic cell line of the invention to produce TGPP 
complex in the culture medium, wherein TGFp complex comprises mature TGpp polypeptide 
and LAP polypeptide fused with a purification tag polypeptide; (b) purifying the TGPp complex 
by binding the TGPp complex with a binding agent that specifically binds to the purification tag 
polypeptide; (c) activating the TGPp complex to dissociate mature TGPP polypeptide fi^om 
associated LAP polypeptide; (d) separating mature TGPp polypeptide firom ithe LAP 
polypeptide; and (e) recovering the TGPp polypeptide firom the isolated cell line or its medium. 

[0021] In another aspect, the invention relates to an isolated Chinese hamster ovary cell 
line comprising a pro-TGPp polynucleotide encoding a mammalian pro-TGPp polypeptide, 
wherein the polynucleotide does not encode a cysteine residue within the first ten amino acid 
residues of the pro-TGPp polypeptide. 

[0022] In another aspect, the invention relates to a method of producing mature TGPp 
polypeptide comprising culturing an isolated eukaryotic cell line comprising a recombinant pro- 
TGPp polynucleotide encoding a mammaUan pro-TGpp polypeptide, wherein the polynucleotide 
does not encode a cysteine residue within the first ten amino acid residues of the pro-TGPp 
polypeptide, and wherein the cell line is cultured under conditions that produce greater than 25 
mg of mature TGPp per liter of culture medium, and recovering the TGPp polypeptide fi-om the 
isolated cell line or its medium. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0023] Figure 1 shows a schematic representation of the latent TGPpl secreted by 
most cells. TGPpi dimer, which is linked noncovalently wifli LAP dimer, has eight intra- and 
one inter-subunit disulphide bonds. 

[0024] Figure 2 shows a schematic diagram of the expression vector that was 
employed for expression of pro-TGppi. 
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[0025] Figure 3 shows an electrophoresis and Western blot analysis of purijaed TGPp 1 . 
Panel A shows a gel electrophoresis analysis of pro-TGFpl purified by a Ni-NTA column. 
Samples were run on a homogeneous 20% SDS-polyacrylamide gel and the gel was stained with 
Coomassie blue. Lane 1 shows the results of electrophoresis under nonreducing conditions, and 
lane 2 shows the results run under reducing conditions. Three major bands showed up which 
represented pro-TGPpi, LAP and mature TGFpl, respectively. Panel B and Panel C show a gel 
electrophoresis analysis of mature TGPp after purification by size exclusion chromatography. 
Samples were run on a homogeneous 20% SDS-polyacrylamide gel. In Panel B, the gel was 
stained wifli Coomassie blue and the lanes were as follows: Lane 1, molecular weight markers 
(kDa); Lane 2, mature TGPp run under nonreducing conditions; Lane3, mature TGpp run under 
reducing conditions. In Panel C, the gel was silver stained. Lanes 1, 2 and 3 contained 2, 1.5 
and Ijxg purified TGPpi, respectively, run under non-reducing conditions; Lane 4 contained 
molecular weight markers. Panel D shows a Westem blot analysis. A 1,5 |ig sample of purified 
TGPpi was run under non-reducing conditions on a 12.5% SDS-polyacrylamide gel. Proteins in 
the gel were transferred to a PVDP mraibrane by electroblotting, and TGPpi was visualized by 
inmiunostaining. 

[0026] Figure 4 shows a size exclusion chromatography profile of TGPp 1. A 2 ml 
sample was loaded onto a HiLoad 16/60 Superdex 200 prep grade column. The equiUbration and 
elution buffer was as follows: 50mM Glycine, 50mM sodium chloride, at pH 4.0. 

[0027] Figure 5 shows a comparison between purified TGPP 1 and commercial TGPp 1 
in binding with hxunan recombinant type n receptor. Samples were made serial dilution firom 
1000 pg/ml to 3 1 .2 pg/ml. Optical density was measured at wavelength of 450mn. 

[0028] Figure 6 shows BL^core sensogram analysis of TpRII binding to TGPp 1 
immobilized on a CM5 chip using the following concratrations of TpRII: 150, 75, 37,5, 18.75, 
9.38, 4.69, 2.34, 1.17, and 0.586 nM, 

[0029] Figure 7 shows an analysis of TpRII binding to TGPp 1 . Results are shown as a 
double reciprocal plot of l/(binding RU) vs. 1 /(concentration of TpRII). 

[0030] Figure 8 shows a comparison between the biological activity of TGPpi and the 
biological activity of commercial TGppi biological activity. All assays were performed in 
tripUcate and error bars represent the standard error of the mean of the samples. 
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DETAILED DESCRIPTION 
A. General Overveew 

[0031] The present invention relates to novel compositions and methods for the high 
efficiency production of the transforming growth factor-beta (TGFP) supergene family of 
peptide growth factors. Preferred transforming growth factor-beta supergene family members are 
TGPp 1, 2, and 3 in eukaryotic cells. Unexpectedly superior results have been achieved in the 
production of mammahan TGFp polypeptides using eukaryotic host cells that have been 
transformed with a recombinant expression vector containing a gene that encodes a mammahan 
pro-TGPp polypeptide. The expression system of the invention has produced a yield of mature 
TGFp polypeptide that is at least about ten fold higher than those previously reported. 

[0032] The TGFp supergene family consists of a set of growth factors that share at least 
25% sequence identity in their mature amino acid sequence with those of TGFps, Members in 
this gene family include the transforming growth factors, TGFpl through 5; inhibins and activins 
(inhibin A, inhibin B, activin A and activin AB; bone morphogenic proteins (BMP), such as 
BMP-2; the decapentaplegic gene complex, DPP-C; Vgl; vgr-1; Miillerian inhibiting substance, 
MIS; growth differentiation factors, GDF-1 and dorsalin-1, dsl-1. Like TGFp, they exist as 
either homo or hetero dimers. See Kingsley, 1994, Genes Dev 8:133-146; Griffith, 1996, Proc 
Natl Acad. Sci U.S,A, 93:878-883; Hogan, 1996, Genes Dev, 10:1580-1594; Massague, 1996, 
Cell 85:947-950; all of which are incorporated by reference in their entirety for all purposes. 
Collectively, these TGFP supergene fandly members are also referred to herem as "TGPP" 

[0033] The five isoforms of TGpps that have been isolated to date from different 
species share a 50-80% sequence homology with a conserved Cys 33 in all five sequences. All 
expressed as disulfide bonded pro-TGFp peptides. Structurally, the mature form of TGPpi, 2, 
and 3 have been shown to be nearly identical. The high sequence and structural homology 
among the five TGFp together with their similar expression/activation pathways suggest that the 
current expression technology be fully appUcable for all five TGFps. (Marks et al, 1991, J. Mol 
BioL 222:581-597, for example.) 

[0034] The monoclonal antibodies herein specifically include "chimeric" antibodies 
(immunoglobulins) in which a portion of the heavy and/or Ught chain is identical with or 
homologous to coiresponding sequences in antibodies derived from a particular species or 
belonging to a particular antibody class or subclass, while the remainder of the chain(s) is 
identical with or homologous to corresponding sequences in antibodies derived from another 
species or belonging to another antibody class or subclass, as well as fragments of such 
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antibodies, so long as they exhibit the desired biological activity (Cabilly et al^ supra\ and 

Morrison a/., 1984, Prac, Natl Acad. ScL U.S.A. 81:6851-6855. 

[0035] 'Humanized" fonns of non-human (e.^., murine) antibodies are chimeric 

immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 

or other antigen-binding subsequences of antibodies) which contain minimal sequence derived 

from non-human immunoglobulin. For the most part, humanized antibodies are human 

immimoglobulins (recipient antibody) in which residues from a complementary-determining 

region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 

(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity, and 

capacity. In some instances, Fv framework region (PR) residues of the human immunoglobulin 

are replaced by corresponding non-himian residues. Furthermore, humamzed antibodies can 

comprise residues which are foimd neither in the recipient antibody nor in the imported CDR or 

framework sequences. These modifications are made to fiulher refine and optimize antibody 

perfomiance. In general, the humamzed antibody will comprise substantially all of at least one, 

and typically two, variable domains, in which all or substantially all of the CDR regions 

correspond to those of a non-human immunoglobulin and all or substantially all of the FR 

regions are those of a human immunoglobulin sequence. The humanized antibody optimally also 

will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a 

human immunoglobulin. For fiirther details, see Jones et al, 1986, Nature 321:522-525; 

Reichmanne^a/., 1988, Mzrwre 332:323-329; Presta, 1992, Curr, Op. Struct. Biol, 2:593-596. 

The humanized antibody includes a Primatized™ antibody wherein the antigen-binding region of 

the antibody is derived from an antibody produced by immunizing macaque monkeys with the 

antigen of interest. 

[0036] **Non-immunogenic in a human" means that upon contacting the polypeptide of 
interest in a physiologically acceptable carrier and in a ther^eutically effective amount with the 
appropriate tissue of a human, no state of sensitivity or resistance to the polypeptide of interest is 
demonstrable upon the second administration of the polypeptide of interest after an appropriate 
latent p^od {e.g.^ 8 to 14 days). 

[0037] A "neutralizing antibody*' is meant an antibody which is able to block or 
significantiy reduce an effector fimction of wild type or recombinant TGFB For example, a 
neutralizing antibody can inhibit or reduce TGFB activation by an agonist antibody, as 
determined, for example, in a neurite survival assays, a TGFB binding assay, or other assays 
taught herein or known in the art. 
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[0038] Methods of producing polyclonal and monoclonal antibodies that react 

specifically with the TGFB proteins are known to those of skill in the art {see, Coligan, 

1991, Cun-ent Protocols in Immunology^ Harlow a/., supra\ Coding, 192^6, Monoclonal 

Antibodies: Principles and Practice (2nd ed.); Kohler at al, 1975, Nature 256:495-497. Such 

techniques include antibody preparation by selection of antibodies fi-om libraries of recombinant 

antibodies in phage or similar vectors, as well as preparation of polyclonal and monoclonal 

antibodies by immunizing rabbits or mice {see, e.g., Huse et aL, 1989, Science 246:1275-1281; 

Ward et al, 1989, Nature 341:544-546. 

[0039] A niimber of immunogens comprising portions of TGFB protein can be used to 
produce antibodies specifically reactive with TGFB protein. For example, recombinant TGFB 
protein or an antigenic firagment thereof, can be isolated as described herein. Recombinant 
protein can be expressed in eukaryotic or prokaryotic cells as described above, and purified as 
generally described above. Recombinant protein is the preferred immunogen for the production 
of monoclonal or polyclonal antibodies. Alternatively, a synthetic peptide derived firom the 
sequences disclosed herein and conjugated to a carrier protein can be used an immunogen. 
Naturally occurring protein can also be used either in pure or impure form. The product is then 
injected into an animal capable of producing antibodies. Either monoclonal or polyclonal 
antibodies can be generated, for subsequent use in immimoassays to measure the protein. 

[0040] Methods of production of polyclonal antibodies are known to those of skill in 
the. art. An inbred strain of ndce {e.g., BALB/C mice) or rabbits is immunized with the protein 
using a standard adjuvant, such as Freimd's adjuvant, and a standard immunization protocol. The 
animal's immune response to the immimogen preparation is monitored by taking test bleeds and 
determining the titer of reactivity to the beta subunits. When appropriately high titers of antibody 
to the immimogen are obtained, blood is collected fi-om the animal and antisera are prepared. 
Further firactionation of the antisera to enrich for antibodies reactive to the protein can be done if 
desired (see, Harlow & Lane, supra). 

[0041] Monoclonal antibodies can be obtained by various techniques familiar to those 
skilled in the art. Briefly, spleen cells fi"om an animal immunized with a desired antigen are 
immortalized, commonly by fusion with a myeloma cell {see, Kohler et al, 1976, Eur. J. 
Immunol 6:51 1-519. Alternative methods of immortalization include transformation with 
Epstein Barr Vims, oncogenes, or retroviruses, or other methods well known in the art. Colonies 
arising fi-om single iimnortalized cells are screened for production of antibodies of the desired 
specificity and aftinity for the antigen, and yield of the monoclonal antibodies produced by such 
cells can be enhanced by various techniques, including injection into the peritoneal cavity of a 
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vertebrate host. Alternatively, one can isolate DNA sequences which encode a monoclonal 
antibody or a binding fragment thereof by screening a DNA library from human B cells 
according to the general protocol outlined by Huse et aly 1989, 5c/ence,246:1275-1281. 

[0042] Monoclonal antibodies and polyclonal sera are collected and titered against the 
immunogen protein in an immunoassay, for example, a soUd phase immunoassay with the 
immunogen iimnobihzed on a solid support. Typically, polyclonal antisera with a titer of 10"* or 
greater are selected and tested for their cross reactivity against non-TGFB proteins, using a 
competitive binding immunoassay. Specific polyclonal antisera and monoclonal antibodies will 
usually bind with a Ka of at least about 0. 1 mM, more usually at least about 1 \\M, preferably at 
least about 0.1 jiM or better, and most preferably, 0.01 \iM or better. Antibodies specific only for 
a particular TGFB ortholog, such as human TGFB , can also be made, by subtracting out other 
cross-reacting orthologs from a species such as a non-human mammal. 

[0043] Once the specific antibodies against TGFB protein are available, tide protein can 
be detected by a variety of immunoassay methods. In addition, the antibody can be used 
therapeutically as TGFB modulators. For a review of immunological and immunoassay 
procedures, see Stites and Terr, eds., 1991, Basic and Clinical Immunology, 7th ed. Moreover, 
the immunoassays of the present invention can be performed in any of several configurations, 
which are reviewed extensively in Maggio, ed., 1980, Enzyme Immunoassay; and Harlow & 
Lane, supra. 

[0044] In a fiirther embodiment, antibodies or antibody fragments can be isolated from 
antibody phage Ubraries generated using the techniques described in McCafferty et al, 1990, 
iVa/i/re 348:552-554; Clackson a/., 1991, iViz^re 352:624-628; Marks a/., \99\, J. Mol 
Biol 222:581-597, describe the isolation of mmine and human antibodies, respectively, using 
phage libraries. Subsequent publications describe the production of high affinity (nM range) 
human antibodies by chain shuffling (Mark et al, 1992, Bio/Technology 10:779-783, as weU as 
combinatorial infection and in vivo recombination as a strategy for constructing very large phage 
Ubraries (Waterhouse et al, 1993, Nuc. Acids. Res. 21:2265-2266. Thus, these techniques are 
viable alternatives to traditional monoclonal antibody hybridoma techniques for isolation of 
monoclonal antibodies. 

[0045] The DNA also can be modified, for example, by substitutiBg the coding 
sequence for human heavy- and light-chain constant domains in place of the homologous murine 
sequences (Cabilly et al, supra; Morrison, et al. 1984, Proc. Nat Acad. Set U.SA. 81:6851; or 
by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence 
for a non-immunoglobulin polypeptide. 
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[0046] Typically such non-inununoglobulin polypeptides are substituted for the 

constant domains of an antibody, or they are substituted for the variable domains of one antigen- 
combining site of an antibody to create a chimeric bivalent antibody comprising one antigen- 
combining site having specificity for an antigen and another antigen-combining site having 
specificity for a different antigen. 

[0047] Chimeric or hybrid antibodies also can be prepared in vitro using known 
methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, iamiunotoxins can be constructed usiag a disulfide-exchange reaction or by forming a 
thioether bond. Examples of svutable reagents for this purpose include iminothiolate and methyl- 
4-mercaptobutyriniidate. 

[0048] Methods for humanizing non-human antibodies are well known in the art. 
Generally, a hiunanized antibody has one or more amino acid residues introduced into it firom a 
source which is nonhuman. These non-human amino acid residues are often referred to as 
"import" residues, which are typically taken fi-om an "import" variable domain. Humanization 
can be essentially performed followiag the method of Winter and co-workers (Jones et al, 1986, 
Nature 321:522-525\ Riechmann e/a/., 1988, iVatwre 332:323-322; Verhoeyen a/., 1988, 
Science 239:1534-1536), by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a hxmian antibody. Accordingly, such 'liumanized" antibodies are chimeric 
antibodies (Cabilly et aL, supra\ wherein substantially less than an intact human variable 
domain has been substituted by the corresponding sequence fi^om a non-human species. In 
practice, humanized antibodies are typically human antibodies in which some CDR residues and 
possibly some FR residues are substituted by residues from analogous sites in rodent antibodies. 

[0049] The choice of human variable domains, both light and heavy, to be used in 
making the humanized antibodies is very important to reduce antigenicity. According to the so- 
called ^^best-fif ' method, the sequence of the variable domain of a rodent antibody is screened 
against the entire library of known human variable-domain sequences. The human sequence 
which is closest to that of the rodent is then accepted as the human framework (FR) for the 
humanized antibody (Sims et al, 1993, J. Immunol 151:2296; Chothia et al, 1987, J. Mol Biol 
196:901. Another method uses a particular framework derived from the consensus sequence of 
all human antibodies of a particular subgroup of Ught or heavy chains. The same framework can 
be used for several dififerent humanized antibodies (Carter et al, 1992, Proc. Natl Acad. Set 
aS.A, 89:4285; Prestae^a/., 1993,7. Immnol 151:2623. 

[0050] It is fiirther important that antibodies be humanized with retention of high 
afSnity for the antigen and other favorable biological properties. To achieve this goal, according 
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to a preferred method, humanized antibodies are prepared by a process of analysis of the parental 
sequences and various conceptual humanized products using three-dimensional models of the 
parental and humanized sequences. Three-dimensional immunoglobulin models are commonly 
available and are familiar to those skilled in the art. Computer programs are available which 
illustrate and display probable three-dimensional conformational structures of selected candidate 
inmiunoglobulin sequences. Inspection of these displays pemiits analysis of the likely role of the 
residues in the functioning of the candidate immxmoglobuUn sequence, i.e., the analysis of 
residues that influence the ability of the candidate immunoglobulin to bind its antigen. In this 
way, FR residues can be selected and combined from the consensus and import sequences so that 
the desired antibody characteristic, such as iacreased affinity for the target antigen(s), is 
achieved. In general, the CDR residues are directly and most substantially involved in 
influencing antigen binding. 

[0051] Alternatively, it is now possible to produce transgenic animals (e.g, , mice) that 
are capable, upon immunization, of producing a full repertofre of human antibodies in the 
absence of endogenous hmnunoglobuUn production. For example, it has been described that the 
homozygous deletion of the antibody heavy-chain joining region (JH) gene in chimeric and 
germ-line mutant mice results in complete inhibition of endogenous antibody production. 
Transfer of the human germ-line inmiunoglobulin gene array in such germ-line mutant mice will 
result in the production of human antibodies upon antigen challenge. See, e.g., Jakobovits et aL, 
1993, Proc. Natl. Acad. ScL U.S.A. 90:2551; Jakobovits a/., 1993, iVa^re 362:255-258; 
Bmggennann et aL, 1993, Year in Immuno 7:33. Human antibodies can also be produced in 
phage-display Ubraries (Hoogenboom et aL, 1991, J. Mol Biol 227:381; and Marks et al, 1991, 
J. Mol Biol, 222:581). 

[0052] Bispecific antibodies (BsAbs) are antibodies that have binding specificities for 
at least two different antigens. BsAbs can be used as tumor targeting or imaging agents and can 
be used to target enzymes or toxins to a cell possessing the TGFB. Such antibodies can be 
derived from fiill length antibodies or antibody fragments {e.g., F(ab')2 bispecific antibodies). 

[0053] Methods for making bispecific antibodies are known in the art. Traditional 
production of fiill length bispecific antibodies is based on the coexpression of two 
immunoglobulin heavy chain-light chain pairs, where the two chains have different specificities 
(Millstein et al, 1983, Nature 305:537-539). Because of the random assortment of 
immunoglobulin heavy and hght chains, these hybridomas (quadromas) produce a potential 
mixture of 10 different antibody molecules, of which only one has the correct bispecific 
structure. Purification of the correct molecule, which is usually done by affinity chromatography 
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Steps, is rather cumbersome, and the product yields are low. Similar procedures are disclosed in 

WO 93/08829, pubUshed May 13, 1993, and in Traunecker et al, 1991, EMBOJ, 10:3655-3659. 

[0054] According to a different and more preferred approach, antibody variable 

domains with the desired binding specificities (antibody-antigen combining sites) are fused to 

immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 

heavy chain constant domain, comprising at least part of the hinge, CH2, and CHS regions. It is 

preferred to have the first heavy-chain constant region (CHI) containing the site necessary for 

Ught chain binding, present in at least one of the fusions. DNAs encoding the immunoglobuUn 

heavy chain fusions and, if desired, the immunoglobulin Ught chain, are inserted into separate 

expression vectors, and are co-transfected into a suitable host organism. This provides for great 

flexibility in adjusting the mutual proportions of the three polypeptide fragments in embodiments 

when unequal ratios of the three polypeptide chains used in the construction provide the 

optimum yields. It is, however, possible to insert the coding sequences for two or all three 

polypeptide chains in one e>q)ression vector when the expression of at least two polypeptide 

chains in equal ratios results in high yields or when the ratios are of no particular significance. 

[0055] In a preferred embodiment of this approach, the bispecific antibodies are 

composed of a hybrid immimoglobulin heavy chain with a first binding specificity in one arm, 

and a hybrid immimoglobulin heavy chain-light chain pair (providing a second binding 

specificity) in the other arm. It was found that this asymmetric stracture faciUtates the separation 

of the desired bispecific compound from unwanted itnmimoglobulin chain combinations, as the 

presence of an immunoglobulm light chain in only one half of the bispecific molecule provides 

for a facile way of separation. This approach is disclosed in WO 94/04690 published Mar. 3, 

1994. For further details of generating bispecific antibodies see, for example, Suresh et al^ 1986, 

Methods in Enzymology 121:210. 

[0056] Bispecific antibodies include cross-linked or "heteroconjugate" antibodies. For 

example, one of the antibodies in the heteroconjugate can be coupled to avidin, the other to 

biotin. Such antibodies have, for example, been proposed to target inmiune system cells to 

unwanted ceUs (U.S. Pat No. 4,676,980), and for treatment of HIV infection (WO 91/00360, 

WO 92/200373, and EP 03089). Heteroconjugate antibodies can be made using any convenient 

cross-linking methods. Suitable crossUnking agents are well known in the art, and are disclosed 

in U.S. Pat. No. 4,676,980, along with a number of cross-linking techniques. 

[0057] Techniques for generating bispecific antibodies firom antibody firagments have 

also been described in the literature. The following techniques can also be used for the 

production of bivalent antibody firagments which are not necessarily bispecific. According to 
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these techniques, Fab'-SH fragments can be recovered from E. coli, which can be chemically 
coupled to form bivalent antibodies. Shalaby et al, 1992, 7. Exp, Med. 175:217-225 describe the 
production of a fiilly humanized BsAb F(ab')2 molecule. Each Fab' fragment was separately 
secreted from E, coli and subjected to directed chemical coupling in vitro to fonn the BsAb. The 
BsAb thus formed was able to bind to cells overexpressing the HER2 receptor and normal 
human T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against 
human breast tumor targets. See also Rodriguez et al, 1992, Int, J. Cancers (Suppl.) 7:45-50. 

[0058] Various techniques for making and isolating bivalent antibody fragments 
directly from recombinant cell culture have also been described. For example, bivalent 
heterodimers have been produced using leucine zippers. Kostehiy et al, 1992, J. Immunol 
148:1547-1553. The leucine zipper peptides from the Fos and Jiin proteins were linked to the 
Fab' portions of two different antibodies by gene fusion. The antibody homodimers were 
reduced at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimers. The "diabody" technology described by HoUinger et al, 1993, Proc. Natl Acad, 
Set U.S. A. 90:6444-6448, has provided an altemative mechanism for maldng BsAb fragments. 
The fragments comprise a heavy-chain variable domain (VH) connected to a light-chain variable 
domain (Vl) by a linker which is too short to allow pairing between the two domains on the 
same chain. Accordingly, the Vh and Vl domains of one fragment are forced to pair with the 
complementary Vh and Vl domains of another fragment, thereby forming two antigen-binding 
sites. Another strategy for making BsAb fragments by the use of single-chain Fv (sFv) dimers 
has also been reported. See Gmber et al, 1994, J. Immunol 152:5368. 

[0059] In one aspect, the antibody is conjugated to an "effector*' moiety. The effector 
moiety can be any nximber of molecules, including labeling moieties such as radioactive labels or 
fluorescent labels, or can be a therapeutic moiety. In one aspect the antibody modulates the 
activity of the protein. 

[0060] "Increased or enhanced expression or activity of a polypeptide of the present 
invention," or "increased or enhanced expression or activity of a polynucleotide encoding a 
polypeptide of the present invention," refers to an augmented change in activity of the 
polypeptide or protein. Examples of such increased activity or expression include the following: 
Activity of the protein or expression of the gene encoding the protein is increased above the level 
of that in wild-type, non-transgenic controls. Activity of the protein or expression of the gene 
encoding the protein is in an organ, tissue or cell where it is not normally detected in wild-type, 
non-transgenic controls (i.e., spatial distribution of the protein or expression of the gene 
encoding the protein is altered). Activity of the protein or expression of the gene encoding the 
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protein is increased when activity of the protein or expression of the gene encoding the protein is 
present in an organ, tissue or cell for a longer period than in a wild-type, non-transgenic controls 
(i.e., duration of activity of the protein or expression of the gene encoding the protein is 
increased). 

[0061] *T)ecreased expression or activity of a protein or polypeptide of the present 
invention/' or "decreased expression or activity of a nucleic acid or polynucleotide encoding a 
protein of the present invention," refers to a decrease in activity of the protein. Examples of such 
decreased activity or expression include the following: Activity of the protein or expression of 
the gene encoding the protein is decreased below the level of that in wild-type, non-transgenic 
controls. 

[0062] The term "sequence identity*' refers to a measxire of similarity between amino 
acid or nucleotide sequences, and can be measmed using methods known in the art, such as those 
described in further detail below. 

[0063] The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are 
the same or have a specified percentage of amino acid residues or nucleotides that are the same 
(z.e, 60% identity, preferably 65%, 70%, 75%, 80%, 85%, 90%, or 95% identity over a specified 
region (see, e.g., SEQ ID NO:l), when compared and aligned for maximum correspondence over 
a comparison window, or designated region as measured using one of the following sequence 
comparison algorithms or by manual alignment and visual inspection. 

[0064] The phrase "substantially identical," in the context of two nucleic acids or 
polypeptides, refers to two or more sequences or subsequences that have at least of at least 60%, 
often at least 70%, preferably at least 80%, most preferably at least 90% or at least 95% 
nucleotide or amino acid residue identity, when compared and aligned for maximum 
correspondence, as measured using one of the sequence comparison algorithms discussed in 
detail below or by visual inspection. Preferably, the substantial identity exists over a region of 
the sequences that is at least about 50 bases or residues in length, more preferably over a region 
of at least about 100 bases or residues, and most preferably the sequences are substantially 
identical over at least about 150 bases or residues, hi a most preferred embodiment, the 
sequences are substantially identical over the entire length of the coding regions. 

[0065] The phrase "sequence similarity" in the context of two nucleic acids or 
polypeptides, refers to two or more sequences that are identical or in the case of amino acids, 
have homologous amino acid substitutions at either 50%, often at least 60%, often at least 70%, 
preferably at least 80%, most preferably at least 90% or at least 95% of the indicated positions. 
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"(0066] Control sequences" or **regulatory sequences" refers to DNA sequences 

necessary for the expression of an operably linked coding sequence in a particular host organism. 

The control sequences that are suitable for prokaryotes, for example, include a promoter, 

optionally an operator sequence, a ribosome binding site, and possibly, other as yet poorly 

understood sequences. Eukaryotic cells are knoAvn to utilize promoters, polyadenylation signals, 

and enhancers. 

[0067] "Vector" refers to a nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a 
circular double stranded DNA loop into which additional DNA segments can be ligated. Another 
type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous repUcation in a host cell into which they are 
introduced (e.g.^ bacterial vectors having a bacterial origin of replication and episomal 
mammaUan vectors). Other vectors (e.g., non-episomal mammalian vectors) can be integrated 
into the genome of a host cell upon introduction into the host cell, and thereby are replicated 
along with the host genome. Moreover, certain vectors are capable of directing the expression of 
genes to which they are operatively linked. Such vectors are referred to herein as "recombinant 
expression vectors" (or simply, "expression vectors"). Li general, expression vectors of utiUty in 
recombinant DNA techniques are often in the form of plasmids. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most commonly used 
form of vector. However, the invention is intended to include such otiier forms of expression 
vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), which serve equivalent functions. . 

[0068] The term "promoter" refers to a region or sequence determinants located 
upstream or downstream firom the start of transcription and which are involved in recognition 
and binding of RNA polymerase and other proteins to initiate transcription. 

[0069] "Cell", "cell line", and "cell culture" are used interchangeably and all such 
designations include progeny. Thus, the words *transfonnants" and "transformed cells" include 
the primary subject cell and cultures derived therejfrom without regard for the number of 
transfers. It is also understood that all progeny cannot be precisely identical in DNA content, due 
to dehberate or inadvertent mutations. Mutant progeny that have the same function or biological 
activity as screened for in the originally transformed cell are included. Where distinct 
designations are intended, it will be clear from the context. 

[0070] The term "recombinant host cell" (or simply '*host cell") refers to a cell into 
which a recombinant expression vector has been introduced. It should be understood that such 
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terms are intended to refer not only to the particular subject cell but to the progeny of such a cell. 

Because certain modifications can occur in succeeding generations due to either mutation or 

environmental influences, such progeny can not, in fact, be identical to the parent cell, but are 

still included within the scope of the terai "host cell" as used herein. 

[0071] An "expression cassette" refers to a nucleic acid constmct, wliich when 
introduced into a host cell, results in transcription and/or translation of a RNA or polypeptide, 
respectively. Expression cassettes can be derived from a variety of sources depending on the host 
cell to be used for expression. For example, an expression cassette can contain components 
derived from a viral, bacterial, insect, or mammalian source. In the case of both expression of 
transgenes and inhibition of endogenous genes (e.g^., by antisense, or sense suppression) one of 
skill will recognize that the inserted polynucleotide sequence need not be identical and can be 
"substantially identical" to a sequence of the gene from which it was derived As explained 
below, these variants are specifically covered by this term. 

[0072] In the case where the inserted polynucleotide sequence is transcribed and 
translated to produce a fimctional polypq)tide, one of skill will recognize that because of codon 
degeneracy a number of polynucleotide sequences will encode the same polypeptide. These 
variants are specifically covered by the term "polynucleotide sequence firom" a particular gene. 
In addition, the term specifically includes sequences (e.g., full length sequences) substantially 
identical with a gene sequence encoding a polypeptide of the present invention, e.g., SEQ ID 
NO: 1, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4, and that encode proteins that retain the 
fimction of a protein of the present invention, e.g, , specific binding to their respective TGFp 
supergene family members. 

[0073] In the case of polynucleotides used to inhibit expression of an endogenous gene, 
the introduced sequence need not be perfectly identical to a sequence of the target endogenous 
gene. The introduced polynucleotide sequence will typically be at least substantially identical to 
the target endogenous sequence, 

[0074] "Receptor*' denotes a cell-associated protein, for example a TGFp receptor 
(TGPP-R; type I or -type n, both of which are serine/threonine kinases) on the cell surface, that 
binds to a bioactive molecule teraied a "ligand," for example, a TGFp dimer. This interaction 
mediates the effect of the ligand on the cell. Receptors can be membrane bound, cytosoUc or 
nuclear; monomeric (e.g., thyroid stimulating hormone receptor, beta-adrenergic receptor) or 
multimeric. Membrane-bound recq)tors, for example TGFp-R, are characterized by a multi- 
domain structure comprising an extracellular ligand-binding domain and an intracellular efTector 
domain that is typically involved in signal transduction. In certain membrane-botmd receptors, 
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the extracellular ligand-binding domain and the mtraceUular effector domain are located in 
separate polj^eptides that comprise the complete functional receptor. 

[0075] In general, the binding of ligand to receptor results in a conformational change 
in the receptor that causes an interaction between the effector domain and other molecule(s) in 
the cell, which in tum leads to an alteration in the metaboUsm of the cell. Metabolic events that 
are often linked to receptor-ligand interactions include gene transcription, phosphorylation, 
dephosphorylation, increases in cychc AMP production, mobilization of cellular calcium, 
mobilization of membrane Upids, cell adhesion, hydrolysis of inositol Upids and hydrolysis of 
phosphoUpids. 

[0076] 'Tffigh afiSnity" for a ligand refers to an equiUbrium association constant (Ka) of 
at least about lO^M'^ at least about lO'^M"^ at least about 10^\ at least about 10^M"\ at least 
about 10^]Vr\ at least about 10^M'\ at least about 10^^ at least about 10^^'\ at least about 
10"M"\ or at least about lO^^M"* or greater, e.g., up to 10^^^ or 10^'*M'^ or greater. However, 
"high afiBnity" binding can vary for other ligands. 

[0077] "Ka", as used herein, is intended to refer to the equilibrium association constant 
of a particular Ugand-receptor interaction, e.g., antibody-antigen interaction. This constant has 
units of 1/M. 

[00781 as used herein, is intended to refer to the equilibrium dissociation 

constant of a particular Ugand-receptor interaction. This constant has units of M. 

[00791 "ka", as used herein, is intended to refer to the kinetic association constant of a 
particular ligand-receptor interaction. This constant has units of 1/Ms. 

[0080] "kd", as used herein, is intended to refer to the kinetic dissociation constant of a 
particular ligand-receptor interaction. This constant has vmits of 1/s. 

[0081] The phrase "specifically (or selectively) binds" to an antibody refers to a 
binding reaction that is determinative of the presence of the protein in a heterogeneous 
population of proteins and other biologies. Thxis, under designated immxmoassay conditions, the 
specified antibodies bind to a particular protein at least two times the background and do not 
substantially bind in a significant amount to other proteins present in the sample. 

[0082] The phrase "specifically bmd(s)" or ^^bindCs) specifically^' when referring to a 
peptide refers to a peptide molecule which has interaiediate or high binding affinity, exclxisively 
or predominately, to a target molecule. The phrase "specifically binds to" refers to a binding 
reaction which is determinative of the presence of a target protein in the presence of a 
heterogeneous population of proteins and other biologies. Thus, imder designated assay 
conditions, the specified binding moieties bind preferentially to a particular target protem and do 

-18- 



wo 2005/068499 PCTAJS2005/000378 
not bind in a significant amount to other components present in a test sample. Specific binding to 
a target protein under such conditions can require a binding moiety that is selected for its 
specificity for a particular target antigen. A variety of assay foraiats can be used to select ligands 
that are specifically reactive with a particular protein. For example, solid-phase ELIS A 
immunoassays, immunoprecipitation, Biacore and Western blot are used to identify peptides that 
specifically react with TGFp domain-containing proteins. Typically a specific or selective 
reaction will be at least twice background signal or noise and more typically more than 10 times 
background. Specific binding between TGFp and TGpp-R means a binding affinity of at least 
10' M'\ and preferably 10^ 10^ 10^ 10^ 10^ or 10^^ M'^ The binding afiEbiity of TGFp and 
TGFp-R is preferably between about 10^ M"^ to about 10*° MT^ 

[0083] "Tarticular hgand-receptor interactions" refers to the experimental conditions 
under which the equihbrium and kinetic constants are measured. 

[0084] "Pharmaceutically acceptable excipient "means an excipient that is useful in 
preparing a pharmaceutical composition that is generally safe, non-toxic, and desirable, and 
includes excipients that are acceptable for veterinary use as well as for human pharmaceutical 
use. Such excipients can be solid, liquid, semisoUd, or, in the case of an aerosol composition, 
gaseous. 

[0085] '"Pharmaceutically acceptable salts and esters" means salts and esters that are 
pharmaceutically acceptable and have the desired pharmacological properties. Such salts include 
salts that can be formed where acidic protons present in the compounds are capable of reacting 
with inorganic or organic bases. Suitable inorganic salts include those formed with the alkali 
metals, e.g,, sodium and potassium, magnesium, calcimn, and aluminum. Suitable organic salts 
include those formed with organic bases such as the amine bases, e.g., ethanolamine, 
diethanolamine, triethanolamine, tromethamine, N methylglucamine, and the Uke. Such salts 
also mclude acid addition salts formed with inorganic acids {e.g., hydrochloric and hydrobromic 
acids) and organic acids (e.g, acetic acid, citric acid, maleic acid, and the alkane- and arene- 
sulfonic acids such as methanesulfonic acid and benzenesulfonic acid). Phamiaceutically 
acceptable esters include estCTs formed from carboxy, sulfonyloxy, and phosphonoxy ^oups 
present in the compounds, e.g., alkyl esters. When there are two acidic groups present, a 
pharmaceutically acceptable salt or ester can be a mono-acid-mono-salt or ester or a di-salt or 
ester; and similarly where there are more than two acidic groups present, some or all of such 
groups can be saUfied or esterified. Compounds named in this invention can be present in 
unsalified or unesterified fomi, or in saUfied and/or esterified form, and the naming of such 
compounds is intended to include both the original (unsalified and unesterified) compound and 
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its phannaceutically acceptable salts and esters. Also, certain compounds named in this 

invention may be present in more than one stereoisomeric form, and the naming of such 

compoimds is intended to include all single stereoisomers and all mixtures (whether racemic or 

otherwise) of such stereoisomers. 

[0086] The terms "phannaceutically acceptable", "physiologically tolerable" and 
grammatical variations thereof, as tliey refer to compositions, carriers, diluents and reagents, are 
used interchangeably and represent diat the materials are capable of administration to or upon a 
human without the production of undesirable physiological effects to a degree that would 
prohibit administration of the composition, 

[0087] A "therapeutically effective amounf means the amount that, when administered 
to a subject for treatmg a disease, is sufficient to effect treatment for that disease. 

[0088] Except when noted, the terms "subject" or "patient" are used interchangeably 
and refer to mammals such as human patients and non-human primates, as well as experimental 
animals such as rabbits, rats, and mice, and other animals. Accordingly, the term "subject" or 
"patient" as used herein means any mammalian patient or subject to which the compositions of 
the invention can be administered. 

[0089] "Concomitant administration" of a known cancer therapeutic dmg with a 
pharmaceutical composition of the present invention means administration of the drug and the 
coUectin and/or surfactant protein composition at such time that both the known drug and the 
composition of the present invention will have a therapeutic effect. Such concomitant 
administration may involve concurrent (le., at the same time), prior, or subsequent 
administration of the antiimcrobial dmg with respect to the administration of a compound of the 
present invention. A person of ordinary skill in the art would have no difficulty determining the 
^propriate timing, sequence and dosages of administration for particular dmgs and 
compositions of the present invention. 

[0090] ^Treating" refers to any indicia of success in the treatment or amehoration of an 
injury, pathology or condition, including any objective or subjective parameter such as 
abatement; remission; diminishing of symptoms or making the injury, pathology, or condition 
more tolerable to the patient; slowing in the rate of degeneration or decline; making the final 
point of degeneration less debihtating; or improving a subject's physical or mental well-being. 
The treatment or amehoration of symptoms can be based on objective or subjective parameters; 
including the results of a physical examination. Accordingly, the term 'treating" includes the 
administration of the compoimds or agents of the present invention to inhibit a disease. 
Accordmgly, the term "treating'* includes the administration of the compounds or agents of the 
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present invention to prevent or delay, to alleviate, or to arrest or inhibit development of the 

symptoms or conditions associated with a disease or other disorder. The term "therapeutic 

effect" refers to the reduction, elimination, or prevention of the disease, symptoms of the disease, 

or side effects of the disease in the subject. 

[00911 Additional fusion proteins of the invention cau be generated through the 
techniques of gene-shuffling, motif-shuffling, exon-shufOing, or codon-shuffling (collectively 
referred to as 'T)NA shuffling''). DNA shuffling can be employed to modulate the activities of 
polypeptides of the present invention thereby effectively generating agonists and antagonists of 
the polypeptides. See, for example, U.S. Pat. Nos. 5,605,793; 5,811,238; 5,834,252; 5,837,458; 
Patten et aL, 1997, Curr. Opinion Biotechnol 8:724-733; Harayama, 1998, Trends BiotechnoL 
16:76-82; Hansson et al, 1999, J, Mol Biol 287:265-276; Lorenzo et al, 1998, Biotechniques 
24:308-313. (Each of these documents is hereby incorporated by reference). In one embodiment, 
one or more components, motifs, sections, parts, domains, fragments, and the like, of coding 
polynucleotides of the invention, or the polypeptides encoded thereby can be recombined with 
one or more components, motifs, sections, parts, domains, fragments, and the like, of one or 
more heterologous molecules. 

[0092] The term "epitope-tagged" when used herein refers to a chimeric polypeptide 
comprising TGFp fused to a "tag polypeptide". The tag polypeptide has enough residues to 
provide an epitope against which an aatibody can be made, yet is short enough such that it does 
not interfere with biological activity of the TGpp . The tag polypeptide preferably also is fairly 
imique so that the antibody does not substantially cross-react with other epitopes. Suitable tag 
polypeptides generally have at least six amino acid residues and usually between about 8-50 
amino acid residues (preferably between about 9-30 residues). Preferred are poly-histidine 
sequences, which bind nickel, allowing isolation of the tagged protein by Ni-NTA 
chromatography as described for example in Lmdsay, et al Neuron 17:571-574, 1996. 

[0093] The nucleic acids of the invention are present in whole cells, in a cell lysate, or 
in a partially purified or substantially pure form. A nucleic acid is "isolated" when purified away 
from other cellular components or other contaminants, e,g,^ other cellular nucleic acids or 
proteins, by standard techniques, including alkaline/SDS treatment, CsCl banding, column 
chromatography, agarose gel electrophoresis and others well known in the art {See, e.g., 
Sambrook, Tijssen and Ausubel discussed herein and incorporated by reference for all purposes). 
The nucleic acid sequences of the invention and other nucleic acids used to practice this 
invention, whether KNA, cDNA, genomic DNA, or hybrids thereof, can be isolated from a 
variety of sources, genetically engineered, amplified, and/or expressed recombinantly. Any 
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recombinant expression system can be used, including, in addition to bacterial, e.g., yeast, insect 
or mammalian systems. Alternatively, these nucleic acids can be chemically synthesized in vitro. 
Techniques for the manipulation of nucleic acids, such as, e.g., subcloning into expression 
vectors, labeling probes, sequencing, and hybridization are weU described in the scientific and 
patent Uterature, see, e.g., Sambrook, Tijssen and Ausubel. Nucleic acids can be analyzed and 
quantified by any of a number of general means well known to those of skill in the art. These 
include, e.g., analytical biochemical methods such as NMR, spectrophotometry, radiography, 
electrophoresis, capillary electrophoresis, high performance liquid chromatography (HPLC), thin 
layer chromatography (TLC), and hyperdiffusion chromatography, various immunological 
methods, such as fluid or gel precipitin reactions, immunodiffusion (single or double), 
Immunoelectrophoresis, radioimmunoassays CRIAs), enzyme-linked immunosorbent assays 
(ELISAs), immuno-fluorescent assays, Soufliem analysis. Northern analj^s, dot-blot analysis, 
gel electrophoresis (e.g., SDS-PAGE), RT-PCR, quantitative PGR, other nucleic acid or target or 
signal amplification methods, radiolabeling, scintillation counting, and afBnity chromatography. 

[0094] "Essentially pure" protein means a composition comprising at least about 90% 
by weight of the protein, based on total weight of the composition, preferably at least about 95% 
by weight. ''Essentially homogeneous" protein means a composition comprising at least about 
99% by weight of protein, based on total weight of the composition. 

[0095] "lohibitors," "activators," and "modulators" of TGFp activity are used to refer 
to inhibitory, activating, or modulating molecules, respectively, identified using in vitro and in 
vivo assays for TGFp binding or signaling, e.g., ligands, agonists, antagonists, and tiieir 
homologs and mimetics. 

[0096] The term "modulator" includes inhibitors and activators. Inhibitors are agents 
that, e.g., bind to, partially or totally block stimulation, decrease, prevent, delay activation, 
inactivate, desensitize, or down regulate the activity of TOF^s, e.g., antagonists. Activators are 
agents that, e.g., bind to, stimulate, increase, open, activate, facilitate, enhance activation, 
sensitize or up regulate the activity of TGFps, e.g., agonists. Modulators include agents that, e.g., 
alter the interaction of TGPPs with: proteins that bind activators or inhibitors, receptors, 
including proteins, peptides, lipids, carbohydrates, polysaccharides, or combinations of the 
above, e.g.. lipoproteins, glycoproteins, and the like. Modulators include genetically modified 
versions of naturally-occurring TGFp ligands, e.g., with altered activity, as well as naturally 
occurring and synthetic ligands, antagonists, agonists, small chemical molecules and the like. 
Such assays for inhibitors and activators include, e.g., ^plying putative modulator compounds to 
a cell expressing a TGFp and then determining the functional effects on TGFp signaling. 
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Samples or assays comprising TGFp that are treated with a potential activator, inhibitor, or 
modulator are compared to control samples without the inhibitor, activator, or modulator to 
examine the extent of inhibition. Control samples (untreated with inhibitors) can be assigned a 
relative TGFp activity value of 100%. Inhibition of TGFp is achieved, for example, when the 
TGFP activity value relative to the control is about 80%, optionally 50% or 25-0%. Activation of 
TGFP is achieved when the TGFp activity value relative to the control is 1 10%, optionally 
150%, optionaUy 200-500%, or 1000-3000% higher. 

[0097] This invention relies on routine techniques in the field of recombinant genetics. 
Basic texts disclosing the general methods of use in this invention include Sambrook et al. 
Molecular Cloning. A Laboratory Manttal, 2nd ed., 1989; Kriegler, Gene Transfer and 
Expression: A Laboratory Manual, 1990; and Ausubel et at., eds., Cun-ent Protocols in 
Molecular Biology. 1994, all of which are incorporated by reference in their entireties for all 
purposes. 

B. PRO-TGFP POLYNUCLEOTTOES 

[0098] As used herein, a "pro-TGPp polynucleotide" refers to a polynucleotide that 
encodes a mammaUan pro-TGFp polypeptide. Polynucleotides of the invention can be derived 
from natural sources or they can be synthetically produced. The pro-TGFp polynucleotide can 
encode any mammaUan pro-TGFpi, pro-TGFp2, or pro-TGFp3 polypeptide. A mammaUan pro- 
TGFp polypeptide typically contains a latency associated peptide (LAP) portion and a mature 
TGFp portion. Preferred pro-TGFp polynucleotides of the invention are those that encode a pro- 
TGFp having a mature TGFp portion that is at least 90% identical to a mature mammaUan TGFp 
polypeptide, and more preferably, at least 95% identical to a mature mammaUan TGFp 
polypeptide. 

[0099] The pro-TGFP polypeptide sequence, and the associated nucleic acid sequence, 
for any particular species can be readily ascertamed by techniques that are weU known in the art 
The polypeptide sequences for several mammalian TGFP polypeptides, both the mature 
polypeptide and the LAP polypeptide, can be found at the National Center for Biotechnology 
Information (NCBI) web site for the following organisms: TGFP 1 -human (NCBI Accession No: 
POl 137); TGFpl-sheep (NCBI Accession No: P50414); TGFpi-rat (NCBI Accession No: 
P17246); TGFpi-pig (NCBI Accession No: P07200); TGFpl-bovine (NCBI Accession No: 
P18341); TGFp 1 -domestic guinea pig (NCBI Accession No: Q9Z1T6); TGFpl-horse (NCBI 
Accession No: O1901 1); TGFpl-dog (NCBI Accession No: P54831); TGFpi-mouse (NCBI 
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Accession No: P04202); TGFpi-Aflricaii Green Monkey (NCBI Accession No: P09533); 

TGFp2-human (NCBI Accession No: P081 12); TGFp2-rat (NCBI Accession No: NP 1 12393); 
TGFpS-human (NCBI Accession No: P10600). 

[0100] Preferred pro-TGPP polynucleotides of the invention are those that encode a 
polypeptide having a mature TGPp portion that is at least 90% identical to mature human TGPpi 
(SEQ ID NO:l; residues 279-390), mature human TGFp2 (SEQ ID NO:3; residues 303-404), 
and mature human TGFp3 (SEQ ID NO:4; residues 301-402), and preferably at least 95% 
identical to a mature human TGPpi, 2, or 3, and more preferably at least 98% identical to mature 
human TGFpl, 2, or 3. 

[0101] Preferred pro-TGPP polynucleotides of the invention are those tliat encode a 
pro-TGFp polypeptide having a LAP portion that this at least 90% percent identical to a the LAP 
portion of a mammaUan TGFp polypeptide, and more preferably, at least 95% identical to the 
LAP portion of a mammalian TGFp polypeptide. Particularly preferred TGFp polynucleotides 
of the invention are those that encode a polypeptide haviag a LAP portion that is at least 90% 
identical to the LAP portion of human TGFpl (SEQ ID NO:l; residues 30-278), the LAP portion 
of hunaan TGFp2 (SEQ ID NO:2; residues 20-302), the LAP portion of human TGPps (SEQ ID 
NO:4; residues 21-300), or the LAP portion of porcine TGFpi (SEQ ID NO:2; residues 30-278), 
and more preferably, at least 95% identical to the LAP portion of human TGFpi, human TGFp2, 
human TGFp3, or porcine TGPpi. 

[0102] Preferred pro-TGPp polynucleotides of the invention are those that encode a 
mammalian pro-TGFp polypeptide having a LAP portion that does not contain a cysteine residue 
within the first 10 amino acids. More preferred TGPP polynucleotides of the invraition are those 
that encode a polypeptide having a LAP portion as follows: (i) a LAP portion of human TGppi 
(SEQ ID NO:l; residues 30-278), wherein the cysteine at residue 33 has been substituted by 
another amino acid; (ii) a LAP portion of human TGFp2 (SEQ ID NO:2; residues 20-302), 
wherein the cysteine at residue 24 has been substituted by another amino acid; (iii) a LAP 
portion of human TGFp3 (SEQ ID NO:4; residues 21-300), wherein the cysteine at residue 27 
has been substituted by another amino acid; (iv) a LAP portion of porcine TGPpi (SEQ ID 
NO:2; residues 30-278), wherein the cysteine at residue 33 has been substituted by another 
amino 

[0103] In a preferred embodiment, the indicated c>^eine residue in the LAP 
polypeptide sequences discussed above is r^laced with an amino acid selected fix)m the group 
consisting of: Ala, Gly, Thr, Asp, Asn, Glu, Ghi, Val, Tyr, He, Leu, Met, Lys, His, Tip, Phe, 
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Arg, Pro. In a particularly preferred embodiment of the invention, the indicated cysteine residue 
in the LAP polypeptide sequences discussed above is replaced with a serine residue. 

[0104] TGFpi Precursor-human (NCBI Accession No: POl 137) (SEQ ID NO:l) (the 
LAP portion is shown underlined): 

1 mppsglrlll lllpllwllv Itpgrpaagl stcktidmel vkrkrieair ggilsklrla 
61 sppsqqevpp gplpeavlal ynstrdrvaq esaepepepe adyyakevtr vlmvethnei 
121 ydkfkqsths iymffntsel reavpepvll sraelrllrl klkveqhvel yqkysnnswr 
181 ylsnrllaps dspewlsfdv tgwrqwlsr ggeiegfrls ahcscdsrdn tlqvdinqft 
241 tqrrqdlati. hqmnrpflll matpleraqh Iqssrhrr al dtnycfsste knccvrqlyi 
301 dfrkdlgwkw ihepkgyhan fclgpcpyiw sldtqyskvl alynqhnpga saapccvpqa 
361 leplpivyyv grkpkveqls niaivrsckcs 

[0105] TGFPI Precursor-porcine (NCBI Accession No: P07200) (SEQ ID NO:2) (the 
LAP portion is shown underlined): 

1 mppsglrllp lllpllwllv Itpgrpaagl stcktidmel vkrkrieair ggilsklrla 
61 sppsqqdvpp gplpeavlal ynstrdrvaq esvepepepe adyyakevtr vlmlesgnqi 
121 ydkfkqtphs lymlfntsel reavpepvll sraelrllrl klkveqhvel yqkysndswr 
181 ylsnrllaps dspewlsfdv tgwrqwltr reaieqfrls ahcscdskdn tlhveinqfn 
241 sgrrqdlati hqinnrpflll matpleraqh Ihssrhrr al dtnycfsste knccvrqlyi 
301 dfrkdlgwkw ihepkgyhan fclgpcpyiw sldtqyskvl alynqhnpga saapccvpqa 
361 leplpivyyv grkpJcveqls nmivrsckcs 

[0106] TGFP2 Precursor-Human (NCBI Accession No: P081 12) (SEQ ID NO:3) (the 
LAP portion is shown underlined): 

* 1 mhycvlsafl ilhlvtvals Istcstldmd qfmrkrieai rqqilsklkl tsppedypep 
^1 eevppevisi ynstrdllqe kasrraaace rersdeeyya kevykidmpp ffpsenaipp 
121 tf yrpvfriv rfdvsaxnekn asnlvkaefr vfrlqnpkar vpeqrielyq ilkskdltsp 
181 tqryidskw ktraeqewls fdvtdavhew Ihhkdrnlqf kislhcpcct fvpsnnyiip 
241 nkseelearf agidgtstyt sqdqktikst rkknsqktph lllmllpsyr lesggtnrrk 
301 kraldaaycf mvqdncclr plyidfkrdl gwkwihepkg ynanfcagac pylwssdtqh 
361 srvlslynti npeasaspcc vsqdleplti lyyigktpki eqlsnmivks ckcs 

[0107] TGFP3 Precursor-Human (NCBI Accession No: P10600) (SEQ ID NO:4) (the 
LAP portion is shown underlined): 

1 mkmhlqralv vlallnfatv slslstcttl dfqhikkkrv eairqqilsk Irltsppept 

61 vmthvpyqvl alynstrell eerohqereeq ctqentesey yakeihkfdm iqglaehnel 

121 avcpkqitsk vfrfnvssve knrtnlfrae frvlrvpnps skrneqriel fqilrpdehi 

181 akqryiqqkn Iptrqtaewl sfdvtdtvre wllrresnlq leisihcpch tfqpngdile 

241 nihevmeikf kqvdneddhq rqdlqrlkkq kdhhnphlil mmipphrldn pqqqqqrkkr 

301 aldtnycfm leenccvrpl yidfrqdlgw kwvhepkgyy anfcsgpcpy Irsadtthst 

361 vlglyntlnp easaspccvp qdlepltily yvgrtpkveq Isnmwksck cs 

[0108] By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that the amino 
acid sequence of the subject polypeptide is identical to the query sequmce except that the subject 
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polypeptide sequence can include up to five amino acid alterations per each 100 amino acids of 
the query amino acid sequence. In other words, to obtain a polypeptide having an amino acid 
sequence at least 95% identical to a query amino acid sequence, up to 5% of the amino acid 
residues in the sxibject sequence can be inserted, deleted, or substituted with another amino acid. 
These alterations of the reference sequence can occur at the amino or carboxy terminal positions 
of the reference amino acid sequence or anywhere between those terminal positions, interspersed 
either individually among residues in the reference sequence or in one or more contiguotis 
groups within ttxe reference sequence. 

[0109] As a practical matter, whether any particular polypeptide is at least 80%, 85%, 
90%, 95%, 96%, 97%, 98% or 99% identical to, for example, the amino acid sequence of any 
portion of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3., or SEQ ID NO:4, can be determined 
conventionally using known computer programs. 

[0110] The terms '^percentage of sequence identity" and "percentage homology" are 
used interchangeably herein to refer to comparisons among polynucleotides and polypeptides, 
and are determined by comparing two optimally aUgned sequences over a comparison window, 
wherein the portion of the polynucleotide or polypeptide sequence in the comparison window 
can comprise additions or deletions {i.e., gaps) as compared to the reference sequence (which 
does not comprise additions or deletions) for optimal aUgmnent of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical nucleic 
acid base or amino acid residue occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of positions in the 
window of comparison and multiplyiug the result by 100 to yield the percentage of sequence 
identity. Identity is evaluated using any of the variety of sequence comparison algorithms and 
programs known in the art. Such algorithms and programs include, but are by no means limited 
to, TBIASTN, BLASTP, FASTA, ITASTA, CLUSTALW, FASTDB, the disclosures of which 
are incorporated by reference in their entireties. Pearson et al, 1988, Proc. Natl. Acad. Sci. 
U.S.A. 85:2444-2448; Altschul et al., 1990, J. Mol Biol 215:403410; Thompson et al, 1994, 
Nucleic Acids Res. 22:4673-4680; Higgins a/., 1996, Meth. Enzymol 266:383402; Altschul 
al, 1993, Nature Genetics 3:266-272; and Brutlag et al, 1990, Comp. App. Biosd. 6:237-24. 

[01 1 1] A preferred method for determining the best overall match between a query 
sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on 
the algorithm of Brutlag et al, 1990, Comp. App. Biosci. 6:237-245). In a sequence aUgnment 
tiie query and sabject sequences are either both nucleotide sequ^ces or botii amino acid 
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sequences. The result of said global sequence alignment is in percent identity. Preferred 

parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2. Mismatch 

Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l, Window 

Size=sequeiice length. Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length 

c 

of the subject amino acid sequence, whichever is shorter. 

t [0112] Ifthe subject sequence is shorter than the query sequence due to N- or C- 

terminal deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for N- and C-tenninal 
truncations of the subject sequence when calculating global percent identity. For subject 
sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity 
is corrected by calculating the number of residues of the query sequence that are N- and C- 
terminal of the subject sequence, which are not matched/aligned with a corresponding subject 
residue, as a percent of the total bases of the query sequence. Whether a residue is 
matched/ahgned is determined by results of the FASTDB sequence ahgnment. This percentage 
is then subtracted fix)m the percent identity, calculated by the above FASTDB program using the 
specified parameters^ to arrive at a final percent identity score. This final percent identity score 
is what is used for the purposes of the present invention. Only residues to the N- and C-tennini 
of the subject sequence, which are not matched/ahgned with the query sequence, are considered 
for the purposes of manually adjusting the percent identity score. That is, only query residue 
positions outside the farfliest N- and C-terminal residues of the subject sequence. 

[0113] For example, a 90 amino acid residue subject sequence is aUgned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N-teiminus of 
the subject sequence and therefore, the FASTDB alignment does not show a matching/aUgnment 
of the first 10 residues at the N-terminus, The 10 ui^aired residues represent 10% of die 
sequence (number of residues at the N- and C-temuni not matched/total number of residues in 
the query sequence) so 10% is subtracted firom the percent identity score calculated by the 
FASTDB pro^-am. Ifthe remaining 90 residues were perfectly matched the final percent 
identity would be 90%. In another example, a 90 residue subject sequence is compared with a 
100 residue query sequence. This time the deletions are internal deletions so there are no 
residues at the N- or C-termini of the subject sequence that are not matched/ahgned with the 
query. In this case the percent identity calculated by FASIDB is not manually corrected. Once 
again, only residue positions outside the N- and C-terminal ends of the subject sequence, as 
displayed in the FASTDB alignment, which are not matched/aligned with the query sequence are 
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manually corrected for. No other manual corrections are to be made for the purposes of the 
present invention. 

[01 14] Another preferred method is using the BLAST programs. The BLAST programs 
identify homologous sequences by identifying similar segments, which are referred to herein as 
*Tiigh-scoring segment pairs," between a query amino or nucleic acid sequence and a test 
sequence which is preferably obtained from a protein or nucleic acid sequence database. High- 
scoring segment pairs are preferably identified (/.e., aligned) by means of a scoring matrix, many 
of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 matrix, the 
disclosures of which are incorporated by reference in their entireties). Less preferably, the PAM 
or PAM250 matrices can also be used {see^ e.g., Schwartz, et al, eds., 1978, Matrices For 
Detecting Distance Relationships: Atlas Of Protein Sequence And Structure, Washington: 
National Biomedical Research Foundation, the disclosure of which is incorporated by reference 
in its entirety). The BLAST programs evaluate the statistical significMice of all high-scoring 
segment pairs identified, and preferably select those segments which satisfy a user-specified 
threshold of significance, such as a user-specified percent homology. Preferably, the statistical 
significance of a high-scoring segment pair is evaluated using the statistical significance formula 
of Karlin, the disclosxire of which is incorporated by reference in its entirety. The BLAST 
programs can be used with the default parameters or with modified parameters provided by the 
user. Gonnet et aL, 1992, Science 256:1443-1445; Henikofif a/., 1993, Proteins 17:49-61; 
Karlin a/., 1990. 

C. Pro-TGFP PoLYNucLEoxroES Uses 

Isolation of nucleic acids encoding pro-TGFp polynucleotide family members 
[0115] Pro-TGF|3 polynucleotides, polymorphic variants, orthologs, and alleles that are 
substantially identical to sequences provided herein, as well as other TGF[J supergene family 
members, can be isolated using TGFp nucleic acid probes and ohgonucleotides xmder stringent 
hybridization conditions, by screening Ubraries. Alternatively, expression libraries can be used to 
clone TGFp protein, polymorphic variants, orthologs, and alleles by detecting expressed 
homologs inmiunologically with antisera or purified antibodies made against human TGFP or 
portions thereof. 

[01 16] To make a cDNA library, one should choose a source that is rich in TGFP RNA. 
The mRNA is then made into cDNA using reverse transcriptase, ligated into a recombinant 
vector, and transfected into a recombinant host for propagation, screening and cloning. Methods 
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for making and screening cDNA libraries are well known {see^ e,g.^ Gubler et aL, 1983, Gene 
25:263-269; Sambrooke^ a/., 19S3, supra; Ausnhel et aL, supra). 

[0117] For a genomic library, the DNA is extracted from the tissue and either 
mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb. The 
fragments are then separated by gradient centriftigation from undesired sizes and are constructed 
^ in bacteriophage lambda vectors. These vectors and phage are packaged in vitro. Recombinant 

phage are analyzed by plaque hybridization as described in Benton et aL, 1977, Science 196:180- 
182. Colony hybridization is carried out as generally described in Grunstein et aL, 1975, Proc. 
Natl Acad ScL U.S.A. 72:3961-3965. 

[01181 One exemplary method of isolatuig pro-TGPp nucleic acid and its orthologs, 
alleles, mutants, polymorphic variants, and conservatively modified variants combines the use of 
synthetic oligonucleotide primers and ampUfication of an RNA or DNA template (see U.S. Pat. 
Nos. 4,683,195 and 4,683,202; Innis et al, eds., 1990, PCR Protocols: A Guide to Methods and 
Applications. Methods such as polymerase chain reaction (PCR) and ligase chain reaction (LCR) 
can be used to amplify nucleic acid sequences of human pro-TGFp directly firom mRNA, from 
cDNA, from genomic libraries or cDNA Ubraries. Degenerate oligonucleotides can be designed 
to amplify TGFp homologs using the sequences provided herein. Restriction endonuclease sites 
can be incorporated into the primers. Polymerase chain reaction or other in vitro amplification 
methods can also be useful, for example, to clone nucleic acid sequences that code for proteins to 
be expressed, to make nucleic acids to use as probes for detecting the presence of TGFp 
encoding mRNA in physiological samples, for nucleic acid sequencing, or for other purposes. 
Genes amplified by the PCR reaction can be purified from agarose gels and cloned into an 
appropriate vector. 

[0119] Gene expression of TGFp can also be analyzed by techniques Icnown in the art, 
e.g., reverse transcription and amphfication of mRNA, isolation of total RNA or poly A^ RNA, 
northem blotting, dot blotting, in situ hybridization, RNase protection, high density 
polynucleotide array technology, e.g., and the like. 

[0120] Nucleic acids encoding TGFp or pro-TGFp proteia can be used with high 
density oligonucleotide array technology (e.g. , GeneChipTJ^ to identify TGFp and pro-TGFp 
protein, orthologs, alleles, conservatively modified variants, and polymorphic variants in this 
I' invention. In the case where the homologs being identified are linked to TGFp related diseases, 

they can be used with GeneChip™ as a diagnostic tool in detecting the disease in a biological 
sample, see, e.g., Gunthand et al, 1998, AIDS Res. Hum. Retroviruses 14:869-876; Kozal et aL, 
1996, Nat Med. 2:753-759; Matson et al, 1995, Anal. Biochem. 224:1 10-106; Locldiart et al, 
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1996, Nat Biotechnol: 14:1675-1680; Gingeras a/., 1998, Genome iiey. 8:435-448; Hacia, 
a/., l99Sy Nucleic Acids Res. 26:3865-3866. 



Expression of pro-TGFp polynucleotide family members in prokaryotes and 
eukaryotes 

[0121] To obtain high level expression of a cloned gene, such as those cDNAs 
encoding EDG, one typically subclones EDG into an expression vector that contains a strong 
promoter to direct transcription, a transcription/translation temainator, and if for a nucleic acid 
encoding a protein, a ribosome binding site for translational initiation. Suitable bacterial 
promoters are well known in the art and described, e,g., in Sanibrook et aL and Ausubel et al, 
supra. Bacterial expression systems for expressing the EDG protein are available in, e,g., E, 
colt. Bacillus sp., and Salmonella (Palva et aL, 1983, Gene 22:229-235; Mosbach et aL, 1983, 
Nature 302:543-545. Kits for such expression systems are commercially available. Eukaryotic 
expression systems for mammalian cells, yeast, and insect cells are well known in the art and are 
also commercially available. In one preferred embodiment, retroviral expression systems are 
used in the present invention. 

[0122] Selection of the promoter used to direct expression of a heterologous nucleic 
acid depends on the particular application. The promoter is preferably positioned about the same 
distance from the heterologous transcription start site as it is from the transcription start site in its 
natural setting. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 

[0123] hi addition to the promoter, the expression vector typically contains a 
transcription unit or expression cassette that contains all the additional elements required for the 
expression of the EDG encoding nucleic acid in host cells. A typical expression cassette thus 
contains a promoter operably linked to the nucleic acid sequence encoding EDG and signals 
required for efficient polyadenylation of the transcript, ribosome binding sites, and translation 
termination. Additional elements of the cassette may include enhancers and, if genomic DNA is 
used as the stractural gene, introns with ftmctional spHce donor and acceptor sites. 

[0124] In addition to a promoter sequence, the expression cassette should also contain a 
transcription termination region downstream of the structural gene to provide for efficient 
termination. The termination region may be obtained from the same gene as the promoter 
sequence or may be obtained from different genes. 

[0125] The particular expression vector used to transport the genetic information into 
the cell is not particularly critical. Any of the conventional vectors used for expression in 
eukaryotic or prokaryotic ceUs may be used. Standard bacterial expression vectors include 
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plasmids such as pBR322 based plasmids, pSKF, pET23D, and fusion expression systems such 

as MBP, GST, and LacZ, Epitope tags can also be added to recombinant proteins to provide 
convenient methods of isolation, e.g., c-myc. Sequence tags may be included in an expression 
cassette for nucleic acid rescue. Markers such as fluorescent proteins, green or red fluorescent 
protein, p-gal, CAT, and the like can be included in the vectors as markers for vector 
transduction. 

[0126] Expression vectors containing regulatory elements from eukaryotic viruses are 
typically used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma vims vectors, 
retroviral vectors, and vectors derived from Epstein-Barr virus. Other exemplary eukaryotic 
vectors include pMSG, pAVOOg/A"", pMTOlO/A^, pMAMneo-5, baculovirus pDSVE, and any 
other vector allowing expression of proteins under the direction of the CMV promoter, SV40 
early promoter, S V40 later promoter, metallothionein promoter, murine mammary tumor virus 
promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters shown 
effective for expression in eukaryotic cells. 

[0127] Expression of proteins from eukaryotic vectors can also be regulated using 
inducible promoters. With inducible promoters, expression levels are tied to the concentration of 
inducing agents, such as tetracychne or ecdysone, by the incorporation of response elements for 
these agents into the promoter. Generally, high level expression is obtained from inducible 
promoters only in the presence of the inducing agent; basal expression levels are minimal 

[0128J In one embodiment, the vectors of the invention have a regulatable promoter, 
e.g,j tet-regulated systems and the RU-486 system (see, e.g., Gossen and Bujard, 1992, Proc. 
Nat'lAcad. Set USA 89:5547; Oligino et aL, 1998, Gene Titer. 5:491-496; Wang et al, 1997, 
Gene Titer. 4:432-441; Neering a/., 1996, Blood 88:1147-1155; and Rendahl a/., 1998, iVat 
Biotechnol. 16:757-761. These impart small molecule control on the expression of the candidate 
target nucleic acids. This beneficial feature can be used to determine that a desired phenotype is 
caused by a transfected cDNA rather than a somatic mutation. 

[0129] Some expression systems have markers that provide gene amplification such as 
thymidine kinase and dihydrofolate reductase. Alternatively, higji yield expression systems not 
involving gene amplification are also suitable, such as using a bacxilovirus vector in insect cells, 
with a EDG encoding sequence under the direction of the polyhedrin promoter or other strong 
baculovirus promoters. 

[0130] The elements that are typically included in expression vectors also include a 
replicon that fimctions in E. coli, a gene encoding antibiotic resistance to permit selection of 
bacteria that harbor recombinant plasmids, and unique restriction sites in nonessential regions of 
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the plasmid to aUow insertion of eukaryotic sequences. The particular antibiotic resistance gene 
chosen is not critical, any of the many resistance genes known in the art are suitable. The 
prokaryotic sequences are preferably chosen such that they do not interfere with the repUcation 
of the DNA in eukaryotic cells, if necessary. 

[0131] The use of pro-TGFp polynucleotides of the invention to achieve the higli 
efficiency expression of mammalian pro-TGFp polypeptides in eukaryotic ceUs is contemplated. 
Preferred host cells for expression of the TGFP constructs of the invention are derived ftom 
multicellular organisms. Examples of invertebrate cells include insect ceUs such as Drosophila 
S2 and Spodoptera Sf9 or Spodoptera High 5 cells, as well as plant ceUs. Examples of useful 
mammahaa host cell lines include Chinese hamster ovary (CHO), NSO, NSl and COS cells. 
More specific examples include the following monkey kidney CVl line transfonned by SV40 
(COS-7, ATCC CRL 1651); human embryonic kidney Une 293 or 293 cells subcloned for 
groAvth in suspension culture (Graham et al., 1977, J. Gen Virol. 36:59); Chinese hamster ovary 
cells/-DHFR (Urlaub and Chasin, 1980, Proc. Natl. Acad. Sci. USA 77:4216); mouse Sertoli cells 
(TM4) (Mather, 1980, Biol. Reprod. 23:243-251); human lung cells (W138, ATCC CCL 75); 
hounan Uver cells (Hep G2, HB 8065); and mouse mammary tumor cells (MMT 060562, ATCC 
CCL5 1). In a preferred embodiment of the invention, the use of CHO cells is contemplated. 

[0132] Other eukaryotic organisms that can be employed to achieve expression of the 
TGFp polynucleotides of the mvention are eukaryotic microbes such as filamentous fimgi or 
yeast. Saccharomyces cerevisiae is a commonly used lower eukaryotic host microorganism. 
Others mclude Schizosaccharomyces pombe (Beach and Nurse, 1981, Nature 290:140; EP 
139,383 published Can 2, 1985); Kluyveromyces hosts (U.S. Pat No. 4,943,529; Fleer et al, 
1991, Bio/Technology 9:968-975 such as, e.g., K. lactis (MW98-8C, CBS683, CBS4574; 
Louvencourt et al, 1983, J. Bacteriol 154(2):737-742; K.Jragilis (ATCC 12,424), K. bulgaricus 
(ATCC 16,045), K. wickeramii (ATCC 24,178), K. waitii (ATCC 56,500), K. drosophilanim 
(ATCC 36,906; Van den Berg et al, 1990, Bio/Technology 8:135), K. thermotolerans, and K. 
marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; Sreekrishna et al, 1988, J. 
Basic Microbiol, 28:265-278); Candida; THchoderma reesia (EP 244,234); Neurospora crassa 
(Case et al, 1979, Proc. Natl Acad Sci. USA 76:5259-5263); Schwanniomyces such as 
Schwanniomyces occidentalis (EP 394,538 pubUshed Oct. 31, 1990); and filamentous fungi such 
as, e.g., Neurospora, Penicillium, Tolypocladium (WO 91/00357 published 1991), and 
Aspergillus hosts such as A. nididans (BaUance et al, 1983, Biochem. Biophys. Res. Commun. 
112:284-289; Tilbum et al, 1983, Gene 26:205-221; Yelton et al., 1984, Proc. Natl Acad. Sci. 
USA 81:1470-1474; and A m^er (Kelly and Hynes, l985,EMBOJ., 4:475479). Methylotropic 
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yeasts are suitable herein and include, but are not limited to, yeast capable of growth on 

methanol selected from the genera consisting oiHoitsenula, Candida, Kloeckera, Pichia, 

Saccharomyces, Torulopsis, and Rhodotonda. A list of specific species that are exemplary of 

this class of yeasts can be found in Anthony, 1982, Tlie Biochemistjy of Methylotrophs, 269. 

[0133] The selection of an appropriate host cell can be made by one of skill in the art. 
Host cells are transfected or transformed with nucleic acid molecules of the invention and 
cultured in conventional nutrient media that is modified as appropriate for inducing promoters, 
selecting traasfonnants, or amplifying flie genes encoding the desired sequences. The culture 
conditions, such as media, temperature, pH and the like, can be selected by the skilled artisan 
without undue experimentation. In general, principles, protocols, and practical techniques for 
maximizing the productivity of mammalian cell cultures can be found in M. Butler, ed, 1991, 
Mammalian Cell Biotechnology: a Practical Approach, IRL Press. 

[0134] Methods of eukaryotic cell transfection or transformation are known to the 
ordinarily skilled artisan. For eukaryotic cells without cell walls, the calcium phosphate 
precipitation method of Graham and van der Eb, 1978, Virology 52:456457 can be employed. 
Other techniques that can be employed include the use of liposomes, electroporation, 
microinjection, cell fiision, and DEAE-dextran. Various techniques for transforming mammalian 
cells are described in Keown et al, 1990, Methods in Enzymology 185:527-537) and Mansour et 
al, 1988, Nature 336:348-352. General aspects of mammalian cell host system transfections 
have b^en described in U.S. Pat. No. 4,399,216. Infection wiiti Agrobacterium tumefaciens can 
be used for the transformation of certain plant cells, as described by Shaw et al, 1983, Gene 
23:315) and m WO 89/05859 (published June 29, 1989). Transformations into yeast are 
typically carried out according to the method of Van Solingen et al, 1977, J. Bact 130:946); and 
Hsiao et al, 1979, Proc. Natl Acad, ScL (USA) 76:3829. 

[0135] The TGPP encoding nucleic acid molecules of the invention will typically be 
inserted into a replicable vector for cloning (amplijBcation of the DNA) or for expression. 
Various vectors are pubUcly available. The vector can, for example, be in the form of a plasmid, 
cosmid, viral particle, or phage. The appropriate nucleic acid sequence can be inserted into the 
vector by a variety of procedures. In general, DNA is inserted into an appropriate restriction 
endonuclease site(s) using techniques known in the art. Vector components can include, but are 
not limited to, one or more of an origin of replication, marker gene(s), enhancer elemerit(s), 
promoter(s), and transcription termination sequence(s) . Construction of suitable vectors 
containing one or more of these con:q)onents employs standard techniques that are known to the 
skilled artisan. 
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[0136] Both expression and cloning vectors contain a nucleic acid sequence that 
enables the vector to replicate in one or more selected host cells. Such sequences are well known 
for a variety of bacteria, yeast, and viruses. The origin of repUcation from the plasmid pBR322 
is suitable for most Gram-negative bacteria, the 2^ plasmid origin is suitable for yeast, and 
various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in 
mammalian ceUs. Expression and cloning vectors will typically contain a selection gene, also 
termed a selectable marker. 

[01 37] Suitable selectable markers for mammalian cells enable the identification of 
transfected cells. Examples of suitable marker systems include the dihydrofolate reductase 
(DHFR)/methotrexate system and the glutamine synthetase/methionine sulfoxamine system that 
are well known in the art. These systems are particularly ^plicable for use with Chinese 
hamster ovary cells (Goeddel, 1990, Methods in Enzymology 185:543-551). Selectable markers 
can also be used to increase the productivity of a recombinant cell hne by gene amplification of 
the inserted gene. Thymidme kinase is another selectable marker that can be used in mammalian 
cells. A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid 
YRp7 (Stmchcomb et al., 1979, Nature 282:39; Kingsman et al, 1979, Gene 7: 141 ; Tschemper 
et al, 1980, Gene 10:157. The trpl gene provides a selection marker for a mutant strain of yeast 
lacking the ability to grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 (Jones, 
1977, Genetics 85:12). 

[0138] Expression and cloning vectors usually contain a promoter operably linked to 
the TGPp-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a 
variety of potential host cells are well known. Promoter useful in mammalian cells include, for 
example, the following: (i) promoters obtained from the genomes of viruses such as polyoma 
vmis, fowlpox virus (UK 2,21 1,504 pubUshed Jul. 5, 1989), adenovmis (such as Adenovirus 2), 
bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and 
Simian Virus 40 (SV40); (ii) heterologous mammahan promoters including, for example, the 
actin promoter, an immunoglobulin promoter, and promoters derived from heat-shock genes. 

[0139] Examples of suitable promoting sequences for use with yeast hosts include the 
promoters for 3-phosphoglycerate kinase (Hitzeman et al. , 1980, J. Biol. Chem. 255:2073) or 
other glycolytic enzymes (Hess et al, 1968, J. Adv. Enzyme Reg. 7:149; Holland, 1978, 
Biochemistry 17:4900), such as enolase, glyceraldehyde-3 -phosphate dehydrogenase, 
hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase, and glucokinase. Other yeast promoters, which are inducible promoters having the 
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additional advantage of transcription controlled by growth conditions, are the promoter regions 
for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes 
associated with nitrogen metabolism, metallothionein, glyceraldehyde-3 -phosphate 
dehydrogenase, and enzymes responsible for maltose and galactose utiUzation, Suitable vectors 
and promoters for use in yeast expression are further described ra EP 73,657. 

[0140] Transcription of a TGFp-encoding nucleic acid by higher eukaryotes can be 
increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements 
of DNA, usually about from 10 to 300 bp, which act on a promoter to increase its transcription. 
Many enhancer sequences are now known from mammalian genes, e.g., globin, elastase, 
albumin, alpha-fetoprotein, and insulin. Typically, however, an enhancer from a eukaryotic cell 
virus will be employed. Examples include the SV40 enhancer on the late side of the ireplication 
origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
late side of the replication origin, and adenovirus enhancers. The enhancer can be spliced into 
the vector at a position 5' or 3' to the TGFp coding sequence, but is preferably located at a site 
5' from the promoter. 

[0141] Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
animal, human, or nucleated cells from other multicellular organisms) will also contain 
sequences necessary for the termination of transcription and for stabilizing the mRNA. Such 
sequences include, for example, nucleotide segments transcribed as polyadenylated fragments in 
the untranslated portion of the mKNA encoding a TGF|3 polypeptide. Such sequences can be 
readily obtained from eukaryotic or viral genes. 

[0142] Nucleic acid molecules of the invention can contain a polynucleotide that 
encodes a natural signal polypeptide or a heterologous signal polypeptide to facilitate secretion 
of TGFp polypeptides into the culture medium. A "heterologous signal polypeptide" refers to 
any polypeptide sequence that facilitates secretion of TGFp polypeptides into the culture 
medium, other than the signal polypeptide that is associated with a particular TGFP in nature. In 
a preferred embodiment, nucleic acid molecules of the invention contain a polynucleotide that 
encodes a heterologous signal polypeptide to facilitate secretion of TGFp polyp^tides into the 
culture medium. The choice of signal peptide or can depend on factors such as the type of host 
cells in which the recombinant polypeptide is to be produced. To illustrate, examples of 
heterologous signal peptides that are frmctional in mammalian host cells include the following: 
the signal sequence for interleukin-7 (IL-7) described in U.S. Pat. No. 4,965,195; the signal 
sequence for interle\ikin-2 receptor described in Cosman et al., 1984, Nature 312:768; the 
interleukin-4 receptor signal peptide described in EP 367,566; the type I interleukin«l receptor 
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signal peptide described in U.S. Pat. No. 4,968,607; and the type TL interleuldn-l receptor signal 
peptide described in EP 460,846. A leader sequence that has been shown to be effective in insect 
cells is the lobster tropomyosin leader sequence (Sano et al., 2002, JFEBS Lett. 532:143-146). 
For yeast secretion the signal sequence can be, e.g., the yeast invertase leader, alpha factor leader 
(including Saccharomyces and Kluyveromyces alpha-factor leaders, the latter described m U.S. 
Pat. No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 
362,179 pubUshed Apr. 4, 1990), or the signal described in WO 90/13646 published Nov. 15, 
1990. In mammalian cell expression, manmiahan signal sequences can be used to direct 
secretion of the protein, such as signal sequences from secreted polypeptides of the same or 
related species, as well as viral secretory leaders. 

[0143] Preferred embodiments of the invention contain a polynucleotide that encodes a 
serum albumin signal polypeptide, and more preferably the rat serum albumin signal polypeptide 
having the following sequence (SEQ ID NO:5): mkwvtfllll fisgsfs 

Purification of pro-TGFp polypeptide family members 

[0144] A number of procedures can be employed when recombinant pro-TGFp 
polypeptide is being purified. For example, proteins having establislied molecular adhesion 
properties can be reversible fused to the TGFp protein. With the appropriate Ugand, the pro- 
TGFP protein can be selectively adsorbed to a purification column and then freed from the 
column in a relatively pure form. The fiised protein is then removed by enzymatic activity. 
Finally, pro-TGFp protein could be purified using immunoafBnity columns. 

[0145] The TGPP proteins can also be separated from other proteins on the basis of its 
size, net surfece charge, hydrophobicity, and afSnity for ligands. In addition, antibodies raised 
against proteins can be conjugated to column matrices and the proteins immunopurified. AU of 
these methods are well known in the art It will be apparent to one of skill that chromatogr^hic 
techniques can be performed at any scale and using equipment from many different 
manufacturers (e.g., Pharmacia Biotech). 

[0146] Recombinant proteins are expressed by transfonned bacteria in large amounts, 
typically after promoter induction; but expression can be constitutive. Promoter induction with 
IPTG is one example of an mducible promoter system. Bacteria are grown according to standard 
procedures in the art. Fresh or frozen bacteria cells are used for isolation of protein. 

[0147] Proteins expressed in bacteria may form insoluble aggregates ("inclusion 
bodies"). Several protocols are suitable for purification of TGFp protein inclusion bodies. For 
example, purification of inclusion bodies typicaUy involves the extraction, separation and/or 
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purification of inclusion bodies by disn:q)tion of bacterial cells, e.g., by incubation in a buffer of 
50 mM TRIS/HCL pH 7.5, 50 mM NaCl, 5 mM MgCIi, 1 mM DTT, 0.1 mM ATP, and 1 mM 
PMSF. The cell suspension can be lysed using 2-3 passages through a French Press, 
homogenized using a Polytron (Brinkman Instruments) or sonicated on ice. Alternate methods 
of lysing bacteria are apparent to those of skill in the art {see. e.g., Sambrook et aL, supra; 
Ausubel et al, supra). 

[0148] If necessary, the inclusion bodies are solubilized, and the lysed cell suspension 
is typically centrifuged to remove unwanted insoluble matter. Proteins that formed the inclusion 
bodies may be rraatured by dilution or dialysis wifli a compatible buffer. Suitable solvents 
include, but are not limited to urea (from about 4 M to about 8 M), foimamide (at least about 
80%, volume/volume basis), and guanidine hydrochloride (firom about 4 M to about 8 M). Some 
solvents which are capable of solubiUzing aggregate-forming proteins, for example SDS (sodium 
dodecyl sulfate), 70% formic acid, are inappropriate for use in this procedure due to the 
possibility of irreversible denaturation of the proteins, accompanied by a lack of iamiunogenicity 
and/or activity. Although guanidine hydrochloride and similar agents are denaturants, this 
denaturation is not irreversible and renaturation may occur upon removal (by dialysis, for 
example) or dilution of the denaturant, allowing re-fonnatioii of inmiunologically and/or 
biologically active protein. Other suitable buffers are known to those skilled in the art. Human 
TGFp proteins are separated from other bacterial proteins by standard separation techniques, e.g., 
wifli Ni-NTA agarose resin. 

[0149] Alternatively, it is possible to purify TGPP protein from bacteria periplasm. 
After lysis of the bactraia, when the TGFp protein exported into the periplasm of the bacteria, 
the periplasmic fraction of the bacteria can be isolated by cold osmotic shock in addition to other 
methods known to skill in the art. To isolate recombinant proteins from the periplasm, the 
bacterial cells are centrifuged to form a pellet The pellet is resuspended in a buffer containing 
20% sucrose. To lyse the cells, the bacteria are centrifiiged and the pellet is resuspended in ice- 
cold 5 mM MgS04 and kept in an ice bath for approximately 10 minutes. The cell suspension is 
centriftiged and the siqjematant decanted and saved. The recombinant proteins present in the 
supernatant can be separated from the host proteins by standard separation techniques well 
known to those of skiU in the art. 

[01501 hi a preferred embodiment of the invention, polynucleotides of the invention 
encode a pro-TGFp polypeptide that is ftised to a purification tag polypeptide. As used herein, 
"purification tag polypeptide" refers any polypeptide sequence tiiat facilitates purification of the 
mature TGFp polypeptide wifii an affinity purification system. The nature of the purification tag 
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polypeptide will depend on the particular afiBnity pinification system used. Various systems are 
available. In one embodiment, the afiBnity chromatographic system is immobilized metal affinity 
chromatography (IMAC), which is based on binding of a purification tag polypeptide to a metal 
ion resin. Metal ions can be, e.g,, zinc, nickel, or cobalt ions. The tag can be a polyhistidine 
sequence, which interacts specifically with metal ions such as nickel, cobalt, iron, or zinc. A 
polyhistidine tag can be 2xHis; 3xHis; 4xHis; Sxffis; 6xHis; 7xHis; SxHis or other, provided that 
it binds essentially specifically to a metal ion. The tag can also be a polylysine or polyarginine 
sequence, comprising at least four lysine or four arginine residues, respectively, wtiich interact 
specifically with zinc, copper or a zinc finger protein. 

[0151] Generally, using IMAC, affinity tagged recombinant proteins are expressed and 
purified (>90% pure or more) using a one-step purification procedure firom cmde l3^ates or cell 
culture supematants. Briefly, cells are lysed under native or denaturing conditions (membrane 
proteins, inclusion bodies, and the like) applied to 96 well plates, and incubated with metal- 
chelate affinity resin. The resin is washed to remove nonspecifically bound contaminants, and 
the protein of interest is eluted using either increased imidazole concentrations or low pH. 

[0152] Commercially available systems for IMAC are Icnown to those of skill in the art 
include systems that are available firom Qiagen, Clontech, hiVitrogen and Novagen. 
Polyhistidine tagged proteins can be purified on nickel affinity chromatography as described in 
Example 2 below and as described as follows. Ni-agarose beads are equiUbrated by washing 
twice with a 5 times volume of binding buffer, e,g., 50 mM Hepes pH 7.5, 500 mlVI NaCl, 5% 
glycerol. The binding buffer can also be 50 mM Tris, pH 7.5, 150 mM NaCl, 2.5 mM MgCh. 
The binding buffer can also be combinations of the two buffers described, or have yet different 
ingredients, which a person of skill in the art can readily determine. The supemataoat of the 
harvested medium is added to the equiUbrated Ni-agarose beads. The non-specifically bound 
proteins can be removed witii a wash buffer, which can be the same as the bindmg buffer. 
Additionally, the salt concentration can be increased, e,g,^ firom 150 mM NaCl in the binding 
buffer to 300 mM NaCl in the wash buffer. Bound protem is eluted with elution buffer, which 
can be identical to the binding buffer witii the addition of about 0 M to about 1 M imidazole, 
preferably about 0 mM to about 300 mM imidazole and most preferably about 200 mM 
imidazole. If desired, protein concentrations can be estimated by using the Bio-Rad® protein 
assay and protein purity can be assessed by SDS-PAGE and Coomassie blue staining. The 
protein samples can be flash-frozen and stored at -80^C. 

[0153] Other purification tag polypeptides can comprise an epitope to which an anti-tag 
antibody can selectively bind The presence of such epitope-tagged forms of the pro-TGFp 
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polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of the 
epitope tag enables the pro-TGFp polypeptide to be readily purified by afiBnity purification using 
an anti-tag antibody or another type of affinity matrix that binds to the epitope tag. Various 
epitope tag polypeptides and their respective antibodies are well known in the art. Examples 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag 
polypeptide and its antibody 12CA5 (Field et al, 1988, Mol. Cell. Biol 8:2159-2165); tiie c-myc 
tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto (Evan etal, 1985, Molecular 
and Cellular Biology, 5:3610-3616); and the Herpes Simplex vims glycoprotein D (gD) tag and 
its antibody (Paborsky et al, 1990, Protein Engineering 3(6):547-553). Other tag polypeptides 
include the Flag-peptide (Hopp et al, 1988, Bio Technology 6:1204-1210); theKT3 qpitope 
peptide (Martin et al, 1992 Science, 255:192-194); an alpha-tiibulin epitope peptide (Skinner et 
al, 1991, J. Biol Chem. 266:15163-15166); and the T7 gene 10 protein peptide tag (Lutz- 
Freyermuthe/a/., 1990, Proc. Natl Acad. Sci. USA 87:6393-6397). 

[0154] The TGFP encoding nucleic acid molecules of the invention can be used to 
produce high quantities of mature TGFp in culture. Mature TGFp polypeptides are secreted 
from cells as biologically inactive homodimers. The pro-TGFP is cleaved into Ibe C-terminal 
mature portion and the N-tenninal LAP portion, and the mature TGF remains non-covalently 
bound with the N-tenninal LAP portion after secretion. Mature TGFp is activated by 
dissociation from the LAP polypeptide, which can be accomplished in vitro by heat, acid or 
alkaline treatment, deglycosylation or proteolysis by plasmin (Lawrence et al, 1985, Biochem. 
Biophys. Res. Comfnun. 133:1026-1034; Lyons, 1988,^ Cell Biol 106:1659-1665). 

[0155] When transfected into appropriate host cells and cultured under appropriate 
conditions, the polynucleotides of the invention can be employed to produce at least about 15, 
18, 20, 22, 24, 26, 28, or 30 mgs of mature TGFp polypeptide per titer of spent culture medium. 
As described above, when secreted into the culture medimn, mature TGFP is tjpically non 
covaientiy associated with the LAP polypeptide. As used herein, the term "TGFp complex" 
refers to this non-covalentiy bound conq>lex of mature TGFP and LAP polypeptide. Subsequent 
processing steps can be used to purify the mature TGFp from the culture medium and/or separate 
flie mature TGFp fix)m the LAP polypeptide. It is contemplated as part of the invention that 
these steps can be accomplished in either order. That is, tiie TGFp complex can be purified from 
the culture medium and then resolved into its separate components, or the TGFP complex can be 
dissociated in the culture medium and with the subsequent purification of the mature TGFp, 
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[0156] Standard protein separation techniques for purifying TGFp proteins are also 
contanplated. Often as an initial step, particularly if the protein mixture is complex, an initial 
salt fractionation can separate many of the unwanted host cell proteins (or proteins derived from 
the cell culture media) from the recombinant protein of interest. The preferred salt is ajnxnonium 
sulfate. Ammonium sulfate precipitates proteins by effectively reducing the amount of water in 
the protein mixture. Proteins tlien precipitate on the basis of their solubility. The more 
hydrophobic a protein is, the more likely it is to precipitate at lower ammonium sulfate 
concentrations. A typical protocol includes adding saturated ammonium sulfate to a protein 
solution so that the resultant ammonium sulfate concentration is between 20-30%. This 
concentration will precipitate the most hydrophobic of proteins. The precipitate is then discarded 
(unless the protein of interest is hydrophobic) and ammonium sulfate is added to the supernatant 
to a concentration known to precipitate the protein of interest. The precipitate is then solubilized 
in buffer and the excess salt removed if necessary, either through dialysis or diafiltration. Other 
methods that rely on solubility of proteins, such as cold ethanol precipitation, are well known to 
those of skill in the art and can be used to fractionate complex protein mixtures. 

10157] The molecular weight of the TGFp proteins can be used to isolate it from 
proteins of greater and lesser size using ultrafiltration through membranes of different pore size 
(for example, Amicon or Millipore membranes). As a first step, the protein mixture is 
ulfrafiltered through a membrane with a pore size that has a lower molecular wei^t cut-off than 
the molecular weight of the protein of interest. The retentate of the ultrafiltration is then 
ultrafiltered against a membrane with a molecular cut off greater than the molecular weight of 
the protein of interest. The recombinant protein will pass through the membrane into the filtrate. 
The filtrate can then be chromatogr^hed as desocibed below. 

[0158] The TGpp proteins can also be separated from other proteins on the basis of its 
size, net surfece charge, hydrophobicity, and afBnity for ligands. In addition, antibodies raised 
against proteins can be conjugated to column matrices and the proteins immimopurified. All of 
these methods are well known in the art. It will be apparent to one of skill that chromatographic 
techniques can be performed at any scale and using equipment from many different 
manufacturers (e.g., Pharmacia Biotech). 

[01 59] A preferred method to purify the mature TGFp complex from the culture 
medium employs the use of a binding agent q)ecific for the purification tag polyp^tide as 
described. The purification tag polypeptide is preferably located on the N-tenninal portion of the 
LAP polypeptide. The non-covalent association of the mature TGFp polypeptide and the LAP 
polypeptide allows for the purification of both the mature TGFP polypeptide and the LAP 
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polypeptide using a binding agent for the isolation tag sequence. Once the mature TGFp 

complex is purified from the culture medium, mature TGFp can be dissociated from LAP using 
any of the activation methods described above and known in the art {e.g,, heat, acid or alkaline 
treatment, deglycosylation or proteolysis by plasmin). "Activating the TGFP complex'* as used 
herein refers to any method that results in the dissociation of mature TGFp from the LAP 
polypeptide. The separation of mature TGFp from LAP can be accomplished using any the 
techniques that are well known in the art to accomplish such separations. A preferred method for 
the separation is the use of size exclusion chromatography. 

[0160] The process of the invention including the expression of the polynucleotides of 
the invention in recombinant eukaryotic cells and subsequent purification steps will typically 
produce mature TGFp in an amount of at least 15, 18, 20, 22, 24, 26, 28 or 30 mg of mature 
TGFp per liter of spent culture medium in a composition of matter that is at least about 93, 95, 
97, or 98 percent pure. 

D. Pro-TGFP Amino Acros, Natural Metabolftes, Derivatives, Designed 
Analogs, and Other Organic Molecules 

[0161] The TGFp also include organic molecules having TGFp activity, as defined and 
described herein. While polypeptides and proteins are often described as exemplary, it should be 
understood that TGFp molecules of the present invention are not limited to those having either 
conventional amino acid side chains or a polyamide backbone structure. 

[01 62] As noted previously, the present invention contemplates a variety of TGFp 
molecules, including proteins, polypeptides, and molecules including amino acid residues, as 
well as a variety of TGFp compositions. While one tends tothinlc of the "common" natural 
amino acids {i.e., those listed m Table 1; see Section A above) as being preferred for use in 
biological compositions, it is also true that a wide variety of other molecules, including 
imcommon but naturally occurring amino acids, metabolites and cataboUtes of natural amino 
acids, substituted amino acids, and amino acid analogs, as well as amino acids in the *T)" 
configuration, are usefiil in molecules and compositions of the present invention. In addition, 
"designed" amino acid derivatives, analogs and mimics are also usefiil in various compounds, 
compositions and methods of the present invention, as well as polymers including baclcbone 
structures composed of non-amide linkages. 

[0163] For example, in addition to fihie ]>amino acids listed in Section A above, amino 
acid metaboUtes such as homoarginine, citruUine, ornithine, and a-aminobutanoic acid are also 
usefiil in molecules and compositions of the present invention. 
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[0164] Further, substituted amino acids which are not generally derived from proteins, 
but which are known in nature, are useful as disclosed herein, include the following examples: L- 
canavanine; 1-methyl-L-histidine; 3-methyl-L-histidine; 2-methyl L-histidine; a,e- 
diaminopimeUc acid (L form, meso form, or both); sarcosine; L-omithine betaine; betaine of 
histidine (herzynine); L-citruUine; L-phosphoarginine; D-octopine; o-carbamyl-D-serine; y- 
aminobutanoic acid; and P-lysine. D-amino acids and D-amino acid analogs, including the 
following, are also useful in proteins, peptides and compositions of the present invention: D- 
alanine, D-serine, D-valine, D-leucine, D-isoleucine, D-aUoisoleucine, D-phenylalanine, D- 
glutamic acid, D-proline, and D-allohydroxyproline, and the like. 

101 65] The present invention also discloses that an extensive variety of amino acids, 
including metaboUtes and cataboUtes thereof, can be incorporated into molecules which display a 
TGFp activity. For example, molecules such as ornithine, homoargjnine, citrulUne, and a- 
aminobutanoic acid are useful components of molecules displaying TGFP activity as described 
herein. 

[0166] It should also be appreciated that the present invention encompasses a wide 
variety of modified amino acids, including analogs, metaboUtes, cataboUtes, and derivatives, 
irrespective of the time or location at which modification occurs. In essence, one can place 
modified amino acids into three categories: (1) cataboUtes and metabolites of amino acids; (2) 
modified amino acids generated via posttranslational modification ie.g., modification of side 
chains); and (3) modifications made to amino acids via non-metaboUc or non-cataboUc processes 
(e.g., the synthesis of modified amino acids or derivatives in the laboratory). 

[0167] The present invention also contemplates that one can readily design side chains 
of the amino acids of residue units that include longer or shortened side chains by adding or 
subtracting methylene groups in either linear, branched chain, or hydrocarbon or heterocycUc 
ring arrangements. The linear and branched chain structures can also contain non-carbon atoms 
such as S, O, or N. Fatty acids can also be useful constituents of TGFp molecules herein. The 
designed side chains can terminate with (R') or without (R) charged or polar group appendages. 

[01 68] In addition, analogs, including molecules resulting from the use of different 
linkers, are also useful as disclosed herein. Molecules with side chains Unked together via 
Unkages other than the amide linkage, e.g., molecules containing amino acid side chains or other 
side chains (R- or R'-) wherein the components are Unked via carboxy- or phospho-esters, 
ethylene, methylene, ketone or ether Unkages, to name a few examples, are also useful as 
disclosed herein. In essence, any amino acid side chain, R or R' group-containing molecule can 
be useful as disclosed herein, as long as the molecule includes alternating hydrophiUc and 
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hydrophobic residues (i.e., component molecules) and displays TGFp activity as described 
herein. 

[0169] The present invention also contemplates molecules comprising peptide dimers 
joined by an appropriate linker, e.g,, peptide dimers Unked by cysteine molecules. (As those of 
skill in the art are aware, two cysteine molecules can be linlced together by a disulfide bridge 
formed by oxidation of their thiol groups.). Such linkers or bridges can thus cross-link different 
polypeptide chains, dimers, trimers, and the like. Other useful linkers which can be used to 
connect peptide dimers and/or other peptide multimers include those listed above, e.g.^ carboxy- 
or phospho-ester, ethylene, methylene, ketone or ether linkages, and the like. 

[0170] While it is appreciated that many useful polypeptides disclosed herein^ e.g., the 
TGFP polypeptides (e.g^., SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4), 
comprise naturally-occurring amino acids in the "L" form which are joined via peptide linkages, 
it should also be miderstood that molecules including amino acid side chain analogs, non-amide 
linkages (e.g., differing backbones) can also display a significant TGFp activity and can possess 
other advantages, as well. For example, if it is desirable to construct a molecule ie.g., for use in a 
TGFP composition) that is not readily degraded, one can wish to syntliesize a polypeptide 
molecule comprising a series of D-amino acids. 

[0171] **Polypeptoids" are a class of non-natural, sequence-specific polymers 
representing an alternative derivative of a peptide backbone. Structurally, tiiey differ fi-om 
polypeptides in tiiat their sidechains are pendant groups of tiie amide nitrogen rather than the a- 
carbon (see, e.g., Simon et al, 1992, Proc. Natl Acad. Set U.S.A. 89:9367-9371 and 
Zuckennann et al, 1992, J. Am. Chem. Soc. 114:10646-10647). **Retropeptoids" are believed to 
have a higher probability of bioactivity when protein binding is required, as the relative 
positioning of sidechains and carbonyls "line up" more closely with peptides (Kruijtzer, 1995, 
Tetrahedron Letters 36:6969-72). N-Substitution prevents proteolysis of the peptoid backbone 
(see, e.g.. Miller et al, 1995, Drug Dev Res 35:20-32), giving enhanced biostability. Since 
polypeptoids are not proteolyzed, they are not strongly immimogenic (Borman, 1998, CiScE 
News 76:56-57). 

[0172] In another variation, one can wish to constract a molecule that adopts a more 
"rigid" conformation; one means of accomplishing this would be to add methyl or other groups 
to the a carbon atom of the amino acids. 

[0173] As noted above, other groups besides a CH3 group can be added to the a carbon 
atom, that is, TGFp molecules of the present invention are not limited to those incorporating a 
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CH3 at the a carbon alone. For example, any of the side chains and molecules described above 
can be substituted for the indicated CH3 group at the a carbon component. 

[0174] As used herein, the tenns "analogs" and "derivatives" of polypeptides and 
amino acid residues are intended to encompass metabolites and catabolites of amino acids, as 
well as molecules which include linkages, backbones, side-chains or side-groups which differ 
from those ordinarily found in what are termed "naturally-occurring" L-form amino acids. (The 
terms "analog" and "derivative" can also conveniently be used interchangeably herein.). Thus, 
D-amino acids, molecules which mimic amino acids and amino acids with "designed" side 
chains (i.e., that can substitute for one or more amino acids in a molecule having TGPP activity) 
are also encompassed by the terms "analogs" and "derivatives" herein. 

[0175] A wide assortment of useful TGFP molecules, including amino acids having one 
or more extended or substituted R or R' groups, is also contemplated by the present invention. 

[01761 Again, one of skill in the art should appreciate that one can make a variety of 
modifications to individual amino acids, to the linkages, and/or to the chain itself, which 
modifications wiU produce molecules falling within the scope of the present iuvention, as long as 
the resulting molecule possesses TGFp activity as described herein. 

[01 77] Having now generally described the invention, the same will be more readily 
understood through reference to the following examples, which are provided by way of 
illustration and are not intended to be limiting of the present invention unless specified. 
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EXEMPLARY EMBODIMENTS 

EXAMPLE 1 

The design of the CHO TGPpi expression construct 
Materials and Methods 

[0178] Chemicals, reagents, and enzymes, Xma I and Bsp 1 191 can be obtained firom 
Fermetas (Hanover, MD). BamH I, Xho I and T4 ligase can be obtained j&om New England 
Biolabs (Beverly, MA). Pfu DNA polymerase can be obtained from Stratagene (Cedar Creek, 
TX). 

[01 79] Construction of the plasmids. An expression plasmid is constructed that contains 
a porcine TGFpl cDNA with several modifications. Porcine TGPpi cDNA encodes a precursor 
TGFplprotein that is 94% identical to the himian precursor TGPpi protein and 100% identical to 
human mature TGFpi The modifications include the introduction of a Cys to Ser amino acid 
substitution at amino acid residue 33, the replacement of the leader sequence in the porcine 
cDNA with the leader sequence from rat serum albumin (MKWVTFLLLL FISGSAFS) (SEQ ID 
NO:5), and the insertion of an 8-histidine tag immediately after the leader sequence. Since the 
tag is fused to the latency peptide, it does not affect the recombinant mature TGFpi amino acid 
sequence, but enables the use a Ni-NTA based metal affinity chromatography for easy 
purification of the recombinant pro-TGFpi polypeptide. An expression plasmid can be 
constmcted using pcDNA3. 1 (+) in which the neomycin gene is replaced with that of glutamine 
synthetase to enable the amplification of recombinant DNA. 

[0180] To constract a vector containing glutamine a synthetase (GS) cDNA sequence, a 
nucleic acid encoding Chinese hamster glutamine synthetase can be ampUfied by PGR using a 
CHO-Kl cDNA library (Stratagene, TX) as a template. Forward and reverse primers that can be 
used are as follows: 5'- GCG ATA CCC GGG TAT ACC ATG GCC ACC TCA GCA AGT-3' 
(SEQ ID NO:6) and 5'- CGG GTG TTCGAA TTA GTT TTT GTA TTG GAA GGG-3' (SEQ 
ID NO:7). Underlined nucleotides of the primers denote the Xma I and Bsp 1 191 sites 
respectively. The PCR product is digested with Xma I and Bsp 11 91 restriction enzymes and 
then ligated to the pcDNA3.1 (-f) plasmid (Invitrogen, CA), which has been previously digested 
with these two enzymes. Thus the neomycin gene in pcDNA3.1 (+) is replaced by GS ORF 
between Xma I and Bsp 1 191, permitting amplification of the vector using MSX. This vector, 
designated pcDNA-GS, is shown in Figure 2. 

[0181] The TGFpi cDNA, modified to contain the leader sequence from rat serum 
albumin (RS A), is then inserted into the pcDNA-GS vector to create a plasmid designated 
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pcDNA-GS-TGFpi. The modified TGPp 1 cDNA can be constructed as foUows. A porcine 
TGFpi cDNA , in which Cys 223 and Cys 225 were mutated to serines, can be obtained firom 
Dr. Seong-Jin Kim, National Institutes of Health, Maryland, and used as a template to amplify a 
TGFpl fi-agment in two PGR reactions using the following primers: TGFl - (5'- GGT TCT 
GCC TTT TCT CAC CAC CAT CAC CAC CAC CAT CAT CTG TCC ACC TGC AAG AC- 
3') (SEQ ID NO:8); TGF2 - (5'- TAG T CTC GAG TTA TCA GCT GCA CTT GCA GG-3') 
(SEQ ID NO:9); RSAl - (5'- AAA GGG GGA TCC GCC ACC ATG AAG TGG GTA ACC 
TTT CTC CTC CTC-3') (SEQ ID NO:10); RSA2 - (5'- AGA AAA GGC AGA ACC GGA 
GAT GAA GAG GAG GAG GAG AAA GGT TAC-3') (SEQ ID NO:ll). A first PCR reaction 
is conducted containing the TGFl, TGF2 primers and the porcine TGFp 1 cDNA. The product of 
the first PCR reaction is then used as a template for a second PGR reaction using the RS Al , 
RSA2 and TGF2 primers. The DNA segment amplified from the second PCR reaction encodes, 
from 5' end to 3' end, a rat serum albumin leader sequence, eight histidine residues and porcine 
pro-TGFpL The PCR product also contams a BamH I site at 5'end and an Xho I site at 3' end. 
The PCR product is then digested with BamH I and Xho I and then inserted between BamH I and 
Xho I sites of pcDNA-GS vector. 

[0182] The serine residues at position 223 and 225 of TGFpi are mutated back to 
cysteines with two oUgonucleotides, 5'- CGC CTC AGT GCC CAC TGT TCC TGT GAC AGC 
AAA GAT AAC-3' (SEQ ID NO:12) and 5'- GTT ATC TTT GCT GTC ACA GGA ACA GTG 
GGC ACT GAG GCG -3' (SEQ ID NO:13), and the QuikChange Site-Directed Mutagenesis kit 
(Stratagene, TX) A further mutation is introduced to replace Cys 33 of TGF|31 with serine using 
primers: 5'-GGA TCC CTG TCC ACC TCC AAG ACC ATC GAC ATG-3' (SEQ ID NO: 14) 
and 5'. CAT GTC GAT GGT CTT GGA GGT GGA CAG GGA TCC-3' (SEQ ID NO:15). The 
resulting expression vector is confirmed by DNA sequencing and designated pcDNA-GS- 
TGFpl. 

EXAMPLE 2 

Expression and purification of TGFpi 

Material and Methods 

[0183] TGFpi content and protein concentration assays. TGFpi content is measured 
by ELISA using a DuoSet ELISA Development kit (R & D Systems, Minneapolis, MN). A 
sample of 0-2fig of mouse anti-human TGPpi antibody is immobilized onto each well of ELISA 
plate. A chicken anti-human TGFpl antibody is used as the detection antibody at a 
concentration of 300ng/ml. Samples are acid activated prior to the assay, and the assay is 
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performed according to the instructioii manual. Protein concentration is determined by using a 
Bicinchoninic Acid (BCA) protein assay kit (Sigma-Aldrich, MO). 

[0184] SDS'PAGE and Western blot assays. SDS-PAGE is performed using a 
Homogeneous 20% PhastGel on PhastSystem (Amersham Biosciences, NJ) or a 12.5% SDS- 
polyacrylamide ready gel (Bio-Rad Laboratories, Hercules, CA). The gels are stained with 
Coomassie brilliant blue or silver, according to manufacturer's specifications, Westem blot 
assays are performed as described by Sambrook et aL, 1989, in Detection and analysis of 
proteins expressed from cloned genes, in Molecular Cloning: A laboratory Manual^ 2nd edition, 
18.64-18.70. A mouse anti-human TGFpl antibody (R&D Systems, Minneapolis, MN) at a 
concentration of 2pg/ml is the primary antibody and a goat anti-mouse IgG-horseradish 
peroxidase conjugate (Bio-Rad Laboratories, CA). at a 1:20000 dilution is the secondary 
antibody. The peroxidase substrate is 3-Amino-9-ethylcarbazole (Sigma-Aldrich, MO). 

Results 

[01 85] Expression of the modified TGFp 1 cDNA of example 1 above is achieved 
through introduction of the pcDNA-GS-TGpp plasmid into Chinese hamster ovary cells 
(hereinafter "CHO-cells"). Cells of the CHO cell line, CHO-lec 3.2.8.1, (available fi-om Dr. 
Pamela Stanley, Albert Einstem College of Medicine, NY) are trypsinized and seeded in a T-25 
flask. Prior to transfection, cells are grown under non-selective conditions in DMEM/F12 
(hivitrogen, Carlsbad, CA), containing 5% fetal bovine serum (FBS) for approximately twenty- 
four hours, until they were ahnost confluent. Immediately prior to transfection, the medium is 
replaced with 4ml fresh DMEM/F12 containing 5% FBS medium. A 10 \xg sample of flie 
plasmid, pcDNA-GS-TGFpl and 30|al aliquot of the Lipofectamine 2000 reagent (hivitrogen, 
CA) are each diluted mto 0.5ml OPTI-MEM I medium separately, and then the dilutions are 
combined to incubate at room temperature for 20 minutes. The mixture is then added to the 
CHO cells and the cells are incubated. After 24 hours, the medium is replaced with fresh 
medium. Two days post-transfection, the cells are trypsinized and seeded into ten 10cm culture 
dishes, and cultured in selection media (Glutamine-Free GMEM-S (JRH Biosciences, Lenexa, 
KS) supplemented with GS supplement, 5% Dialyzed FBS and 30|iM Methionine sulfoximine 
(MSX) (Sigma-Aldrich, St Louis, MO)). After about 3 weeks, colonies are transferred into 48- 
well plates containing 0.5ml selection medium. When the cultures reach confluency, 
supematants are taken and assayed for TGFpi by ELISA, The ten clones showing the highest 
expression levels are subjected to another round of selection by further spUtting of the clones 
into 10cm dishes containing 150pM MSX selection medium. A third round of selection is then 
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made in the presence of 500pM MSX. After three rounds of selection and amplification, the 

clone with highest TGFpi expression is chosen for large-scale recombinant protein production. 

[0186] To generate TGFpi containing medium, the chosen clone is transferred to a T- 
500 triple layer flasks after expansion and cultured in Glutamine-Free GMEM-S, supplemented 
with GS supplement, 5% Dialyzed FBS and 500|xM MSX. When the cultures reach confluency, 
cells are washed twice with Hank's solution and cultured with CHO-S-SFM n serum free 
medium (hivitrogen, Carlsbad, CA). After three to four days, the medium is harvested and 
replaced with fresh serum free medium. The concentration of mature TGFpi in the harvested 
medium, as determined by ELISA assay, is about 30 mg/liter. 

[0187] Due to the high and specific affinity of His-tag with Ni-NTA, the harvest 
meditmi can be directly loaded onto the Ni-NTA column without a prior concentration step. The 
harvested medium is filtered with a 0.22|im cellulose acetate filter and stored at -20°C for fixture 
use. A 20ml Ni-NTA agarose (Qiagen, Valencia, CA) column is equilibrated with a loading 
buffer (50mM Tris, 150mM sodium chloride, pH 8.0). Approximately 500ml harvested medimn 
is thawed and filtered agaia with 0.22|jm cellulose filter and then directly applied to the Ni-NTA 
column. After loading, the column is washed with 30 ml of the loading buffer and the proteiu is 
eluted by a linear concentration gradient of imidazole from 0.0 M to 0.3 M, with 45ml between 
the loading and elution buffer (50mM Tris, 150mM sodium chloride and IM imidazole, pH 8.0). 
After the first step of purification, the eluted protein mainly consists of pro-TGpp 1, giving rise to 
three major bands on SDS-PAGE that represented pro-TGFpi (the uppermost band), LAP (the 
middle band), and mature TGPpi (the lower band) (Figure 3, Panel A) Sequencing of the N- 
terminus of pro-TGFpi and LAP results in a peptide of HHHHHHHHLSTSKTIDMELV (SEQ 
ID NO: 16), indicating that the leader sequence is cleaved off right before the histidine tag. 

[0188] Ni-NTA affinity eluted TGFp 1 is pH adjusted to 3.0 to release the mature 
TGFP 1 (25 KD) from its latency protein LAP (60-75 KD). The mature TGFpi is then purified 
to homogeneity using a size exclusion column. The sample is concentrated two fold, filtered, 
and applied several times to a HiLoad 16/60 Superdex 200 prep grade column (Amersham 
Biosiences, Piscataway, NJ) with a running buffer (50mM Glycine, 50mM sodium chloride, pH 
4.0). Samples from gel filtration peak fractions are taken for analysis (Figure 3, Panels B and C, 
Figure 4). The elution profile shows that TGFpi eluted at a smaller apparent molecular weight 
(Figure 4). This phenomenon has also been observed by other groups and could be due to the 
hydrophobic interaction between TGPpi and the chromatographic matrix. (Gentry et al, 1988, 
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Molecular events in the processing of recombinant type 1 pre-pro-transfonning growth factor 
beta to the mature polypeptide, Mol CellBioL 8: 4162-4168). 

[0189] After only two steps of purification, lOmg purified TGFp 1 is obtained from 
500ml harvest medium. The identity of the purified mature TGFp 1 polypeptide is fiulher 
confiirmed by Westem blot assay (Figure 3, Panel D), N-teraiinal sequencing 
(ALDTNYCFSSTEKNCCVRQL) (SEQ ID NO: 17), and mass spectrometry (Mr. 25574.0). 

EXAMPLES 

Binding between TGFpl and its receptors 

[0190] The binding of TGFp 1 with the TGFp 1 Type E receptor (TpRII) is assessed in 
comparison the binding of commercial TGFpl. Two methods can be used this comparison: (1) 
an immunoassay technique and (2) surface plasmon-resonance measurements. 

(01911 For the immunoassay technique, Quantikine Human TGFp 1 Immunoassay kit 
(R&D Systems, Mmneapolis, MN) is employed. This kit has recombinant human TpRII coated 
on a substrate and provides a TGFpi standard. This test is performed according to the 
instruction protocol and the binding of TGFp 1 to the substrate is assessed in comparison witli the 
TGFpl standard. As shown in Figure 5, the results from this comparison show that the binding 
affinity of the purified TGPpi to TpRII is equal to that of the TGFpi standard. 

[01921 Surface plasmon resonance (SPR) measurements are performed using a BIAcore 
3000 instrument (BIAcore, Uppsala, Sweden). TGppi is immobihzed on two flow cells of a 
CMS sensor chip at concentrations of 0.7 nM and 1.4 respectively, in 50 mM sodium 
acetate, pH 5.0, using iV^hydrosucciimnide/l-ethyl-3 (-3-dimethylaminopropyl)-carbodiimide 
hydrochloride (NHS/EDC) at a flow rate of 20 p.l/min. 

[0193] TGFpi type n receptor (residues 22-136) is prepared essentially as described in 
Boesen, 2000, Protein Expr. Purif. 20:98-104. In brief, a 25 mg sample of the inclusion bodies 
is dissolved in 6M guanidine hydrochloride and then injected into a 1 L refolding solution (0.5 M 
arginine, 20mg/L AEBSF protease inhibitor, 2mM EDTA, 5mM cysteamine (reduced form), 0.5 
mM cystamine (oxidized form), and 0.1 M Tris, pHS.O) .The solution is stirred for 24 hours at 
10°C and then dialyzed against dialysis buffer (5mM Tris, pH8.5). After filtering, the refolding 
solution is loaded onto Source 15Q column (Amersham Biosiences, NJ), which is pre- 
equiUbrated with a ru nnin g buffer (5 mM Tris, pH8.5). Two major protein peaks are eluted by a 
linear gradient firom 0 to 1 M sodium chloride. The first peak fractions are pooled and 
concentrated and then purified with a Superdex 200 column. 
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[0194] Binding of the type H TGPP receptor (TpRH) to the immobilized TGFpl is 
measured using serial dilution of XPRII from 150 nM to 0.6 nM at a flow rate of 20 ^il/min. The 
running buffer and the analyte sample buffer are 10 mM Tris hydrochloride, pH 8.0. The chip 
surface is regenerated with 10 mM glycine, pH 3.0, at a flow rate of 30 pJ/min for 30 seconds, 
followed by buffer stabilization for 3 minutes. The apparent dissociation constant (Kd) is 
obtained from a linear regression of steady state 1/Response versus 1/C plots. Surface plasmon 
resonance measurements indicate an apparent dissociation constant (Ku) of TGFpl with human 
type n receptor was about 55nM (Figure 6 and Figure 7). 

EXAMPLE 4 

TGFpi Biological Activity 

[0195] In order to determine whether the purified TGFp 1 was biologically active, and 
to compare its activity Avith commercial TGFpl, a bioassay is conducted which assesses the 
ability of TGFp 1 to induce luciferase expression in mink-lung-epitheUal cells (MLEC) stably 
transfected with an expression construct comprising a truncated plasminogen activator inhibitor- 
1 promoter fused to the firefly luciferase reporter gene (Abe et al, 1994, Anal Biochem. 
216:276-284). 

[0196] Transfected MLECs are plated in 96-well tissue culture dishes at a concentration 
of 6 X 10"* cells per well. Cells are allowed to attach for 3 hours at 37*^0 in a 5% CO2 incubator 
and then the medium is replaced with test samples containing varying concentrations of TGFP 1 
diluted in DMEM + 0.1% BSA. After 14 hours at 37°C, cell extracts are prepared and assayed 
for luciferase activity using the enhanced luciferase assay kit (Analytical Luminescence, San 
Diego, CA) as per the manufacturer's instructions. Luciferase activity is reported as relative 
light units (RLtJ). CommCTcial TGFpi (R&D Systems, MN) is used as a comparison. The 
dose-response curves show that the biological activities of the purified TGFpi and cormnercial 
TGFpi are comparable (Figure 8). 

[0197] Although the foregoing invention has been described in detail by way of 
example for purposes of clarity of understanding, it will be ^parent to the artisan that certain 
changes and modifications are comprehended by the disclosure and can be practiced without 
xmdue experimentation within the scope of the appended claims, which are presented by way of 
illustration not limitation. 
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[0198] All publications and patent documents cited above are hereby incorporated by 
reference in their entirety for aU purposes to the same extent as if each were so individually 
denoted. 

[0199] Each recited range includes all combinations and sub-combinations of ranges, as 
well as specific numerals contained thereio. 
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